Gene Francci3_2948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2948 
Symbol 
ID3903763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3482058 
End bp3483479 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content71% 
IMG OID637880269 
ProductNAD(P) transhydrogenase, beta subunit 
Protein accessionYP_482035 
Protein GI86741635 
COG category[C] Energy production and conversion 
COG ID[COG1282] NAD/NADP transhydrogenase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.703596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.15563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCT CGGACACGGC GATCCGCCTG CTGTTCCTGG CCGCGGCGAG CTGCTTCGTC 
CTCGGCCTGC ACCTGATGAA CTCGCCCGCG ACCGCCCGCC GGGGCAACCA GCTGTCCGCG
GCCGGGATGG TGGCGGCCGT CGGGACCGCC GTCGGGCTGC TGGCCCGGGA CGGCGTCATC
ACCGCTACCG GCCTGATCGT CCTGATCGCC GGTGTGCTCC TGGGATCCGG CGCGGGCCTG
TATGCGGCGC GGACCGTGCG GATGACCGCG ATGCCGCAGC TGGTCAGCGT GTTCAACGCG
GTCGGCGGCG GCGCGGCGGC GCTCGTCGCG TTCGCCGAGG TGGCGCGGTT GGGCGGCGCG
GCCGGGCTGG CCGGGGCCTC CGGTCAGACC ACCGTGACCG TCGGCCTCGA CGTCGTCATC
GGCGCGGTGA CCTTCTCCGG CTCGCTGGTG GCCGCCGGCA AGCTGCAGGG CCTGATCAGT
GGCGCGCCGG TGCTGTTCCG CGGCGCCCGA CTACTGAACG CCGCGCTGGC CGCGGTCGCC
GTGACCGGCG TGGCGGCGAT CGTCGCTGGT CACGGTGGGA CGATCGCGCT GTCGATCGTC
GCGGCCGCCG CGCTGGGCTT AGGCGTGACG ATGGTGCTGC CGATCGGCGG CGCGGACATG
CCGGTGGTGA TCTCGCTGCT GAACGCGTTC ACCGGGACGG CCGTGGCGAT GGCCGGTTTC
GTGCTCGACA GCACCGTTCT CGTTATCGCG GGCGCGCTGG TGGGGGCGTC CGGCTCCATC
CTCACCGTGC TGATGGCCGA GGCGATGAAC CGGTCGATCC CGGCCATCAT CGTCGGTGGT
TTCGGCACCG CGGACTCGAC CGGAAACGCC GGCGCGGCGG CCTCGGCCCC AGTGCGGTCG
GTCGCGGCGG ACGACGCGGC CCTGCAGCTG GCGTACGCGT CGAAGGTCAT TATCGTGCCC
GGCTACGGGC TTGCGGCGGC GCGGGCCCAG CATGAAGTAG TCGAGCTCGC CGACCTGCTC
ACGCAGCACG GGGTGGACGT CTGCTACGCG ATCCACCCGG TCGCCGGCAG AATGCCGGGC
CACATGAACG TGCTGCTGGC CGAGGCGCAC GTCCCCTATC CGCAGATGAA GGAACTGGCC
GAGGTCAACC CCGAGTTCGC GCAGGCCGAT GTCGCCCTGG TCGTCGGCGC CAACGACGTG
ACCAACCCGG CGGCCCGCCG TCCCGGCAAC GCGATCTCCG GTATGCCGAT CCTCGACGTC
GACCAGGCGA AGAGCGTCAT CGTCATCAAA CGCTCCATGG GCCACGGCTA CGCCGGCATC
GACAACGAGC TGTACACCGA CCCGAAGACC GGAATGCTCT TCAACGACGC GAAAGCCGGC
CTGAGCGCGC TCACCGCCGC CATGAAGACA CTCGTGCACT GA
 
Protein sequence
MSGSDTAIRL LFLAAASCFV LGLHLMNSPA TARRGNQLSA AGMVAAVGTA VGLLARDGVI 
TATGLIVLIA GVLLGSGAGL YAARTVRMTA MPQLVSVFNA VGGGAAALVA FAEVARLGGA
AGLAGASGQT TVTVGLDVVI GAVTFSGSLV AAGKLQGLIS GAPVLFRGAR LLNAALAAVA
VTGVAAIVAG HGGTIALSIV AAAALGLGVT MVLPIGGADM PVVISLLNAF TGTAVAMAGF
VLDSTVLVIA GALVGASGSI LTVLMAEAMN RSIPAIIVGG FGTADSTGNA GAAASAPVRS
VAADDAALQL AYASKVIIVP GYGLAAARAQ HEVVELADLL TQHGVDVCYA IHPVAGRMPG
HMNVLLAEAH VPYPQMKELA EVNPEFAQAD VALVVGANDV TNPAARRPGN AISGMPILDV
DQAKSVIVIK RSMGHGYAGI DNELYTDPKT GMLFNDAKAG LSALTAAMKT LVH