Gene Francci3_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2939 
Symbol 
ID3903754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3469006 
End bp3471255 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content72% 
IMG OID637880260 
ProductTrkA-N 
Protein accessionYP_482026 
Protein GI86741626 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0858519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGG GTTCTGGAAG TCCCGGCGTG ACCGGAGGCA ATGGCACCGT CAACGTTCAT 
GGTGCCCTCG ATGGGCATGG CACCGTCAAC GTTCATGGTG CCCTCAGCGG GCATGGCGCT
GTCAACGGTC AGGGCGGCTC GGGAGGTTCG GGCGCCTTCG GTGGTCTCGG TGGTTCGGGT
GGTTCCGACT CGTCGGGTCA CGTCATTGTC TGCGGGCTGA GCAACCTGGG GCTCCGGCTG
GCCGAACAGC TCCGGGCGTC GGGTGTGACG GTGGTGGCCA TCGACGATCG GGTCGCCCCC
GCCACCCGGC GCAGGGTGGA GCGGTGGGGC GTGCGGGTCC TGGTGGAGAG TACCCGTGGC
GCCGGGGCGC TGCGCGAGGC CGGCATCGCC GCGGCGCTCG CGGTCGTGTC CTGTCACGAG
ACGGACCTCG ACAACCTGGG CACGGCGCTG GTCGCGGCCG AGACGGCCCC CGGTGTCCGG
CTCGTCGTGG CCATCAGTAA CCGCCAGCTC GGCGACCAGC TCGCGGACGC GCTCAGCCAG
GTCCGGGTGC TGAACCGGGC CGAGCTCGCG GGGCCGAGCT TCGTCGAGGC GTGCCTCCGC
TCGGACGTGA CGCACGCCTT CCCGATGGAC GAACGGTCCG ATGCCGAGGT CTTCGCGGTG
ATCGAGGAGT CGATCATAGG CCGGCATGCC TTCCGGGCCC GCTATGGCGA TCTGACCCCG
ATCTCGCTGC GCCGCGCGGG GGAGCGCCTC CCCGAGCTCT GCCCGCCCCG CGACGTCTCG
CTGCGCCCCG GCGACCGGCT GACGCTCCTC GGGCGGCTGT CGGAGTTCCA CGAGCGGGGC
ATGACGGTCG CGGGCCTGCA CGACGCGCGG CTGTTCGCCG CGCTGGCCGG CGGCTCGGCC
GAGCATGGCG ATCCCGGCGA TCCCCGCGAG CCCGGAGGGG TCCGGGGGGT CTGGCGGGCC
ACGGTGGCCA GGTTCCGGGA GATCGTCTCG ATGATCCGTG GGGAGCTCGA CCCCCCCTTC
CGGTTCGCAC TCGCCGCCGT AGTAACGATC ATGATGGTCG GTACGGTCGT GCTGTGGTTG
ACCTACGCCA ACCACAACGA GGCCGCACCC GCCGAGTTCG GCCCGCTGGA CGCGCTTTAC
CTCACGGTCG CGACGATGGC CACGGTCGGC TACGGCGACT TCAACTTCGG CGCCGCCGAC
GAGTGGCTAC AGGTGTTCGG CATCGGACTG ATGCTGCTCG GCGCCCTGTC GATCGCCGTC
GTCTATGCGT TCATTACCAA CATCATCATC AGCCGTCGGT TGGAGCGGGT GATGGGACGC
GGGCGGGCCG GGGCGGTGCG TGGCCACGTC ATTCTATGCA GGCTCGGCTC GGTGGGGGTC
GCCACGATGA ACGGCCTGGT GCGCGCGGGC CGGCATGTGG TGGTGATCGA GCGTGACGAG
AACAACCGGT ACCTGCCCGT CGCCCGTGAA CGTGGGGTGC CCGTGATCAT CGGGGACGCC
ACCGTCCGCT CGACGCTGCT GGAGGCCGGC CTCGCCCATG CCGCCACGAT CGCGGTGCTC
ACCAGCGACG ACGTCGCGAA CCTGGAGGCC GTGCTGTCGG CCCGGGAAGC GCACGAGGAG
TTGCGGGGGG CGTGGGCGGC CCGCCGCCCC GCCCGGCGGG CCGCCCGGCG GGGCGGCGGG
TGGTCCCGCG GCCGGATGCG GGAGTCCGAC CCGGACACGC ACCACCCCGA CCTGCGAGTA
GTGCTGCGGA TCTTCGACAC CACGATGGCC GACGAGGTCG AGCGGCGCTT CGGCATCCAT
ACGGCACGCA GCGCGTCCGC CCTGGCCACG CCGTATTTCG TCGGCGCGGC GCTCGGTTAC
GACGTCATCA GTACCTTCTA CGTGCAGCGC ACGCCGTTCC TGGTGGCCAG GATGACCATC
CACGCCGGCG GCGGGCTGGT CGGCCCGACC CTGCGGGAGC TGTCCACCGG GACCCGGGTG
CTCGCCGTGA TCACCGCGGC CGCGAACGCG GACACCGGCG GAGACGTGAA TCTGGACAGC
GGCGTGAATC TGGACAGCGG CGCGGACACC GACGGCGGCG TGGAGCGGGA CGGCCAGGGG
GCTCCCGGTG GCGGGCCGGA CTACCGGCCG GGACGCCACA CCCGGCTCCG GCCCGGGGAC
GAGCTGTTCG TGGTGGGACC GGTGAACCGG ATCGTCGACA TGGTGCGGCG CAACCAGCAG
GTCGACATCA CCACGGCGGG AACTGCGTAA
 
Protein sequence
MAAGSGSPGV TGGNGTVNVH GALDGHGTVN VHGALSGHGA VNGQGGSGGS GAFGGLGGSG 
GSDSSGHVIV CGLSNLGLRL AEQLRASGVT VVAIDDRVAP ATRRRVERWG VRVLVESTRG
AGALREAGIA AALAVVSCHE TDLDNLGTAL VAAETAPGVR LVVAISNRQL GDQLADALSQ
VRVLNRAELA GPSFVEACLR SDVTHAFPMD ERSDAEVFAV IEESIIGRHA FRARYGDLTP
ISLRRAGERL PELCPPRDVS LRPGDRLTLL GRLSEFHERG MTVAGLHDAR LFAALAGGSA
EHGDPGDPRE PGGVRGVWRA TVARFREIVS MIRGELDPPF RFALAAVVTI MMVGTVVLWL
TYANHNEAAP AEFGPLDALY LTVATMATVG YGDFNFGAAD EWLQVFGIGL MLLGALSIAV
VYAFITNIII SRRLERVMGR GRAGAVRGHV ILCRLGSVGV ATMNGLVRAG RHVVVIERDE
NNRYLPVARE RGVPVIIGDA TVRSTLLEAG LAHAATIAVL TSDDVANLEA VLSAREAHEE
LRGAWAARRP ARRAARRGGG WSRGRMRESD PDTHHPDLRV VLRIFDTTMA DEVERRFGIH
TARSASALAT PYFVGAALGY DVISTFYVQR TPFLVARMTI HAGGGLVGPT LRELSTGTRV
LAVITAAANA DTGGDVNLDS GVNLDSGADT DGGVERDGQG APGGGPDYRP GRHTRLRPGD
ELFVVGPVNR IVDMVRRNQQ VDITTAGTA