Gene Francci3_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0989 
Symbol 
ID3905845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1170406 
End bp1173576 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content70% 
IMG OID637878322 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_480101 
Protein GI86739701 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCGCT GCGTCGCCAT CGTCAACCGT GGTGAGGCCG CGATGCGCCT CATCCGAGCC 
GTTCGGGAGA TAGTCGCCGA GACCGCGACA GCGATCGAGA CCGTCGCCCT GTACACCGAC
GTCGACCGCA CCGCGACGTT CGTGCGCGAG GCCGACAGGG CCTACTGTCT TGGCCCGGCC
GCCGCGCGTC CCTATCTCGA TCTGAGGGTC CTAGAGCGCG CGCTGGTGGA GACTGGGGCC
GACGCCGCGT GGGTCGGCTG GGGCTTCGTC GCGGAGGACC CGGCGTTCGA AGAGCTTTGC
GAGAGGCTCG GGGTCACCTT CATCGGGCCT AGTTCCGAGG CCATGCGTAA GCTCGGCGAC
AAGATCGGCG CGAAGCGGAT CGCCGAGGAG GTCGGCGTGC CGGTCGCGCC GTGGAGCCGC
GGTCCGGTCG AGAGCCTGGA CGCCGCCCTG GCAGCGGCTG CCAAGATCGG TTATCCGCTT
ATGCTCAAGG CGGCCGCGGG CGGCGGCGGT CGGGGCATCC GTGTGATCAG GGACGAGTCC
GAGCTTGTCA ACGCCTTCGA GCGCACCAGG CAAGAGGCCG CGCGGGCGTT CGGCATCGAT
GTCATGTTCC TGGAGCGCCT AATTACTGGG GCCCGGCACG TCGAGGTCCA GGTGATTGCC
GACGGCCAGG GCACCGCGTG GGCGCTCGGC GTCCGTGACT GCTCGGTGCA GCGGCGCAAC
CAGAAGGTAA TCGAGGAGTC GGCCTCGCCG GTGCTCAGCC CGGAGCAGGC GGCCAACCTC
AAGGCCGCGG CCGAGCGGCT GATCGTCGCG GTCGGCTACC GGGGCGCGGC GACCGTCGAG
TTCCTCTACC ATCCCGGCGA CAAGCAGGTC ACGTTTCTCG AGGTCAACAC CCGCCTGCAG
GTCGAGCACC CGATCACCGA GGCCACAACC GGGTTCGACC TGGTCAAGGC CCAACTACAC
GTGGCCTCGG CCGGCCGGCT GCAGGGCGTG CGGCCGGTCG AGCGCGGACA CGCCGTCGAG
GCCCGGCTCA ACGCGGAGGA CCCCGACCGC GACTTCATGC CGTGCCCGGG CCACATCACG
CGGCTGGAGC TGCCAGCCGG ACCCGGCATC CGCGTCGACA CTGGCTTCAG CGAGGGCGAC
ACCATCCCGG CCGACTTCGA CTCGATGATC GTGAAGATCA TCGCCTACGG CCGCGATCGC
GACGGGGCGC TGGGCCGGCT GCGCCGGGCG ATTCAGGAGA CCAAAGTGAT CATCAAGGGT
GGTGTCACGA ACAAGAGCTT CCTGCTGGAC CTGCTCAACC GGCCCGAGCT GATCGACGCC
AGCGCGGACA CCGGCTGGAT CGACCGCGTC GGGGACGGGC TTGTCTCGCA CCGGCACTCC
GCCGTGGCCC TGGTGGCAGC GGCCATCGAC GCGTACGAGG AGAAGGAGCA CGTCGAGCGG
CAGCGGCTGC TGTTGACGGC GTTCGGCGGA CGCCCGCAGG TGCAGCACGA CAGCGCTCGG
CCGCTGGACC TCAAGTTGCG CGACGTCGGC TACCGGGCGC GCGTGGCGCG GCGGGGCCCG
TGCCGGTTCC ACGTCAGCCT CGAAGCGGGC GCCGAGGTCC GCACCGCTGA CGTCAAGATT
GACCGCTTCG ATCGGCACAC CGGGCAGATC GTCGTCAACG GTGCCCGGTA CCGGCTACTC
GCCGACATCC ACGGGCCCGT CCACCTGATC GAGGTGGACG GCGTCACCCA CCGGGTCAGC
CGCGACGAGG GCGACGTCGT CCGCTCGCCC ATGCCTGCCA TGGTTATCGC CACGCCGCTG
GAGGTTGGCG CCGAGATCGA GGCGGGCGCA CAAATCCTGG TGCTGGAGAG CATGAAAATG
GAGACGGTGC TGCAGGCGCC GTTCAAGGCG CGGGTGAGGG AATACTTCGT CTCCGTGGGC
AGCCAGGTAG AGGCGGGTGC GCCGCTGCTG CGGCTGGAGC CGATCGCCGG CGCCGAGGTC
GAGGACAGCC CGGCCGCCGC GGCGGCCGGG CTGGACCTGC CGACCGCGCC CGAGGAGGTC
CCGGCGCGTG AGCGCACCGC GCGCGGCCTG GAGGATCTGC GCGCTTTGCT GCTGGGCTTC
GACCTGCACG ATGAGCGCCA GGTGCTCCAC GACTACCTCG CCGCGCGTCG GGCGGCCACC
GAGGAGGGCC ACCAGCCGCT GGCCAAGGAG CTAGAGATCA TCGAAATATT CGCCGACCTG
GCCGAGCTGA GCCGGAACCG GCCTGCCCGC ATGGACGTCG GCGATGAGGG CCACCTGCGC
AACCCTCACG AGCACTTCCA CACCTACCTG CGCAGCCTCG ACGTCGAGCG GGCCGGGCTA
CCGGAGGCGT TCCAGGCCAA ACTGGCCAAG GCGCTCGGGC ATTACGACGT CACCGACCTG
AAACGCTCCC CCGAGCTCGA GGCCGCGGTG TTCCGGATCT TCCTCGCCCG GCAGCCCGCC
ACCGCCAAGG TGGCCGCCAA GGTAGTTGCG GCGCTGCTGG AATCGTGGCG GTGGGAGCCG
CCGCCGGACA AGTCCGTGCA CAAACCGGTG CGTCTGGCGC TGGAGCGGCT GGTGGCCGCC
ACGAAGGGCC ACTTCCCCGT GGTTGCCGAC CTCGCGCTCG ACGTGATGTT CGCCTGGTTC
CAGCATCAGC AGTTGCTGCG CCGTGCCCAG GAGCGCATCC GTATCTTCGA GCACCTGCTC
CACCTGGACG CCTACCCGGA CGCGCCGGAT CGCACCGAGC GCATCGCCGA GATGGTGCGC
AGCACCGAGC CGCTGGTGCC GCTGCTGGGC CAGCGGCTGC TGCGCGGCGA CCGGGATAAC
ACGGTCATAC TGGAAGTGCT GACCCGCCGG TACTACGGCA ACAAGGACCT CACCCACATC
CGCGCCACCG AGTACGCCGG CTACCGGTTC TGGGTGGCCG AGCGCGCGGG CTCCAGGCTC
GTCTCCTGCG CAGTTGGCTT CGACGCGCTG GACGCGGCGC TGGGTGGGCT TGCGGAGCTG
GCAGGCGGCG AGCGCTCCGT CGATGCTGAC ATCTACCTGT CCTGGGAGAA CCAGCCGGCG
GACTCCGACG TGATGGCGGC CGCGCTGGGC GAGGCCGCCA GCGCGCACCC GCTGCCGCCC
CAGGTGCGTC GGCTCACCAT CACCATCACC CCGGAGGCAG CCGCACGATG A
 
Protein sequence
MFRCVAIVNR GEAAMRLIRA VREIVAETAT AIETVALYTD VDRTATFVRE ADRAYCLGPA 
AARPYLDLRV LERALVETGA DAAWVGWGFV AEDPAFEELC ERLGVTFIGP SSEAMRKLGD
KIGAKRIAEE VGVPVAPWSR GPVESLDAAL AAAAKIGYPL MLKAAAGGGG RGIRVIRDES
ELVNAFERTR QEAARAFGID VMFLERLITG ARHVEVQVIA DGQGTAWALG VRDCSVQRRN
QKVIEESASP VLSPEQAANL KAAAERLIVA VGYRGAATVE FLYHPGDKQV TFLEVNTRLQ
VEHPITEATT GFDLVKAQLH VASAGRLQGV RPVERGHAVE ARLNAEDPDR DFMPCPGHIT
RLELPAGPGI RVDTGFSEGD TIPADFDSMI VKIIAYGRDR DGALGRLRRA IQETKVIIKG
GVTNKSFLLD LLNRPELIDA SADTGWIDRV GDGLVSHRHS AVALVAAAID AYEEKEHVER
QRLLLTAFGG RPQVQHDSAR PLDLKLRDVG YRARVARRGP CRFHVSLEAG AEVRTADVKI
DRFDRHTGQI VVNGARYRLL ADIHGPVHLI EVDGVTHRVS RDEGDVVRSP MPAMVIATPL
EVGAEIEAGA QILVLESMKM ETVLQAPFKA RVREYFVSVG SQVEAGAPLL RLEPIAGAEV
EDSPAAAAAG LDLPTAPEEV PARERTARGL EDLRALLLGF DLHDERQVLH DYLAARRAAT
EEGHQPLAKE LEIIEIFADL AELSRNRPAR MDVGDEGHLR NPHEHFHTYL RSLDVERAGL
PEAFQAKLAK ALGHYDVTDL KRSPELEAAV FRIFLARQPA TAKVAAKVVA ALLESWRWEP
PPDKSVHKPV RLALERLVAA TKGHFPVVAD LALDVMFAWF QHQQLLRRAQ ERIRIFEHLL
HLDAYPDAPD RTERIAEMVR STEPLVPLLG QRLLRGDRDN TVILEVLTRR YYGNKDLTHI
RATEYAGYRF WVAERAGSRL VSCAVGFDAL DAALGGLAEL AGGERSVDAD IYLSWENQPA
DSDVMAAALG EAASAHPLPP QVRRLTITIT PEAAAR