Gene Francci3_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2007 
Symbol 
ID3906723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2356343 
End bp2359210 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content68% 
IMG OID637879343 
ProductFAD linked oxidase-like 
Protein accessionYP_481110 
Protein GI86740710 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTG AGGGGCGTCC CACCGGCGCG GAGGTAGCGG GGAGCCGCGG CGGACCTGGC 
CAGGGGCCGG CCGTGGCAGG CAGCCCGGTC CTGCGCTACG CGATGGCGCG GGATGCGTCG
CACTATCACC TGGTGCCCTC CGCGGTGGAA CGTGTTGCCG GCGTGGGCGA TGTGGCGGGG
CTGTTTGCGA GATGCCGGCG GAGTGGTTCG TACCTGACGT TCCGCTCGGG CGGAACGAGC
CTCAGCGGCC AGGGCGTCAC CGACGGGATC CTGGTCGACG TGCGACACGG TTTTCAGTCG
GCCGAGGTGC TCGACGGTGG TCACCGTTTG CGCGCCGAGC CTGGCGTGAC CGTCCGTGCG
GCCAACGCGC GGCTGCGTCC GTACGGGCGG AAGCTCGGCC CCGACCCAGC GAGTGAGGTC
GCCTGCACCC TGGGTGGAGT CATTGCCAAT AATTCCAGCG GAATGGCCTG CGGCACCGGG
CAGAACGCCT ACCAGACGCT TGAGGCGATG ACCGTGGTGC TACCCAGCGG ATCCGTGATT
GACACCGGTG CGCCGGACGC GGATGAACGG CTGCGCGCAC TGGAACCCGA TCTGTATGCG
GGCTTGCTGC GGCTCCGCGA GCGGATCTGC CGCAACCCGG CGTCCGTGGC GACGCTGCGT
CGGCAGTTCT CAATGAAGAA CACGATGGGC TACAGTCTCA ACTCGTTTCT CGACTACGAG
CGTCCCGTCG AGATCCTGGC CCATCTGATG GTCGGTAGCG AAGGCACGCT CGGGTTCGTC
GCCTCGGCCA CGTTCCGGAC GGTGGAGCTC TTTCCCCACG CCTCGACCGG CTTGGCGGTG
TTTCGTGACC TCGCCACCGC GACCGCCGCG TTGCCCGAGC TGGTCGACGT GGGGTTCGCG
ACGATCGAAC TGATGGACGC TCGGTCCCTT GCCGTTGCGC AGACCCTCGG CGCGACACCG
GCTGAGATCG GCTCGCTGGA GGTGCGTGAC CATGTTGCGC TCCTGGTGGA GCTGCAGGCA
GCGACCACCG AGGAGCTTGC CGACAAGGTG TCCGGGGCCG GCGGCGTGTG CGATGGGCTC
GAGATCGAGT CGCCCCTGGA GTTGACCTCG GACCCGTGGC GCCGAAAGGA CCTGTGGCAT
GTGCGCAAGG GTCTCTATAC GGCCGTCGCG GGAGCGCGCC AGGCGGGGAC CACGGCGTTG
CTGGAGGATG TGGCGGTTCC GGTGCCGCAC CTGCGGGCAG CGTGCCAGGA ACTCACGAAG
CTGTTCGACG CGCACGGCTA TCAGAACAGT GTCATCTTCG GCCATGCCAA GGACGGCAAT
ATCCATTTTA TGCTCACCGA GACCTTCCGG GATCCCGCGC GGTTGGAACG CTACCACGCA
TTCACCGAGA AAATGGTTGA GCTGGTGCTC GAGCACAAGG GGACGCTCAA GGCGGAGCAT
GGCACCGGCC GGATCATGGC GGGGTATGTC CGTCGCCAGT ACGGCGATGA ACTGTATGAC
GTCATGACGG AGGTGAAGCG GCTGTTCGAC CCGCTGGGAA TCCTCAACCC GGGGGTCGTG
CTGTCCGACG ATCCTCGCTC CTATCTGCGC AATCTCAAGG ATGTGCCGAC GGTCGGCTAC
GGCGCGGACA TGTGCGTGGA GTGCGGGTAC TGCGAACCGG TCTGCCCGAG CCGGACGCTG
ACCCTGACTC CTCGGCAGCG GATCGCCCTG CTGCGGGAGC GTGAGGCCGC GCGGCGGGAG
GGAGACGAGG GGCTCGCCGA TGAGCTGTCC GCGGCCTACC GCTATGACGT GGTCGACACC
TGCGCGGTCG ACGGTATGTG CCAGACCGCT TGCCCTGTGC AGATCAACAC CGGATCACTC
GTGCGAGAGC TGCGCGCCGA GAGGGTGAAC AAGGCCGAGG ACGCGCTGTG GCGCTCGGCG
GCCCGCCACT GGGGGGCGAC CACCACACTG GCTGGGAAGG CGCTTTCTGC CGCCGCCGCA
CTGCCGCCGA CGTTGCCGAC AGCCGCGGCC TCCCTGGCCC GCAGAACGTT GGGCACCGAC
AGGATGCCTC AGTATGACGC CTCGCTGCCC CGCGGTGGGT ACCGGCGCCG GGCGGTCGCG
GCTGCCGCAG AGGCGTGCGC CGTGTACTTC CCGGCGTGCG TCGGGGCCAT GTTCGGCTCG
TCGTCATCCA GCGGCGGGGT CATGCCCGCG ATGCTGACGC TGTGCGCGCG TGCCGGGGTA
GCGGTACGGG TACCCAGGGG TATCGCGTCG ATGTGTTGCG GTATGCCGTG GAAGTCGAAG
GGTCTCAGGG GTGGCCACGA GGTCATCGGG GCCAAGGTCC TGCCGGCGCT GCTCGCGGCG
ACTGACGGCG GCCGCCTGCC GGTCGTGTGC GACGCGGCGT CATGTACAGA AGGTCTGGAG
GAGCTTCGCG CCGAGGCAAA GCGGCTTGGC GGCGCCTACG AGGCACTTCG TTTCGTGGAC
GCGCTCGAAT TCGTGCGCGC CGAAGTCGTG GGCCGCCTCT CGGTGACCCG CCGGGTGGCG
TCCCTGGTAC TGCACCCTAC GTGCTCGACC GAGCGGCGGG GCACCACGAC TTTACTCAGG
GAACTCGCCG AGCTGGTCAG CGACGAGGTG GTCGTGCCGC TGGACTGGAA TTGCTGCGCG
TTCGCCGGTG ATCGCGGACT TCTGCATCCC GAGCTGACTG CGGCGGCGAC GCTGAACGAG
GCACGTGAGG TCAACTCCCG CGCCTTCGAG GTGCATGCGT CGGCCAACCG GACCTGCGAG
ATCGGAATGT CACGCGCAAC GGGGCGCGAA TACGTCCACA TTGTCGAGGC GCTGGAGTAC
GCGACTCGCC CGATCCGTGA TTCTGCCCAT CCAGGCGGTG CCGGCTGA
 
Protein sequence
MSLEGRPTGA EVAGSRGGPG QGPAVAGSPV LRYAMARDAS HYHLVPSAVE RVAGVGDVAG 
LFARCRRSGS YLTFRSGGTS LSGQGVTDGI LVDVRHGFQS AEVLDGGHRL RAEPGVTVRA
ANARLRPYGR KLGPDPASEV ACTLGGVIAN NSSGMACGTG QNAYQTLEAM TVVLPSGSVI
DTGAPDADER LRALEPDLYA GLLRLRERIC RNPASVATLR RQFSMKNTMG YSLNSFLDYE
RPVEILAHLM VGSEGTLGFV ASATFRTVEL FPHASTGLAV FRDLATATAA LPELVDVGFA
TIELMDARSL AVAQTLGATP AEIGSLEVRD HVALLVELQA ATTEELADKV SGAGGVCDGL
EIESPLELTS DPWRRKDLWH VRKGLYTAVA GARQAGTTAL LEDVAVPVPH LRAACQELTK
LFDAHGYQNS VIFGHAKDGN IHFMLTETFR DPARLERYHA FTEKMVELVL EHKGTLKAEH
GTGRIMAGYV RRQYGDELYD VMTEVKRLFD PLGILNPGVV LSDDPRSYLR NLKDVPTVGY
GADMCVECGY CEPVCPSRTL TLTPRQRIAL LREREAARRE GDEGLADELS AAYRYDVVDT
CAVDGMCQTA CPVQINTGSL VRELRAERVN KAEDALWRSA ARHWGATTTL AGKALSAAAA
LPPTLPTAAA SLARRTLGTD RMPQYDASLP RGGYRRRAVA AAAEACAVYF PACVGAMFGS
SSSSGGVMPA MLTLCARAGV AVRVPRGIAS MCCGMPWKSK GLRGGHEVIG AKVLPALLAA
TDGGRLPVVC DAASCTEGLE ELRAEAKRLG GAYEALRFVD ALEFVRAEVV GRLSVTRRVA
SLVLHPTCST ERRGTTTLLR ELAELVSDEV VVPLDWNCCA FAGDRGLLHP ELTAAATLNE
AREVNSRAFE VHASANRTCE IGMSRATGRE YVHIVEALEY ATRPIRDSAH PGGAG