Gene Franean1_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3153 
Symbol 
ID5671530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3710804 
End bp3713215 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content68% 
IMG OID641242048 
ProductABC transporter related 
Protein accessionYP_001507468 
Protein GI158314960 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0410] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCAG GAGCGACTCA TCAATGCCCA GATCCTTCGT TATGGCGAGC GCTTCGCCCG 
ACGGGTCGGC TAGCTGCCAT CTTCCGAAAG CAGGCTGACG GTGTCCAAGA GAAACGAATC
CTCGTTATTA CTGATGTATT GGCCGCACCC GTGAACACAC CCCGTGGTGC CGCGACCGGT
GCCGAAGGCG ATGGGGCTAG GGAGACCCGA GGCGGCGCTA CTGGCACTGC GCCGTCGGTT
GCCGGGCTGG CCGCCGGGTT GATCGACGCG GAGACGGAGC GTCGGGCGCA GCAGGCGGAA
TCCCAGCGTG AGGTGCTGTT CGCGGACGAG CTACTCCCCG GTGTCGGCGG GGAGCCACTG
ACTCTGCGCC AGGGGCTGGC GGCGGGCGGT TCGCTGACGT TCCTGACGCT GGTGGTATTG
GCCGCGCTGG ACGAGCTGGA GTCGGCGGCG CTGACGGTGC TGGCGCCAGA CATCCGCGAC
GCGTTCGGGA TCAGCGACAG CGCGATCGTT TTCATCTCGG CGGCGGCCGG AGCCTTCCTC
GTGTTGGGCG CACTACCGAT GGGATGGCTG GCGGACCATC TCCGGCGCAG CCGGCTGATC
GGCTGGGCGG GGGTCGCATT TTCGGTGATG GTGCTCGCCT CGGGCCTCGC GGCGAACGCG
TTTCTGTTCT TCCTGGCCCG GTTCGGCGTC GGCGTGGCGA AGTCAAGCAA CACGGTGCAG
GCCTCACTGC TCGCGGACGC CTATCCCATC GGGGTACGGG GTCGGATCTC GGCGACAACC
TACGGGGCGG CCCGGACCGC GGGCGCGATC AGTCCCCTGC TGGTCGCGGG CATCGCCACC
TGGGTCGGCG GGGACGATGG CTGGCGCTGG CCGTTCCTGA TCCTCGGGCT GCCGGCGCTG
GCCATCGCCG TCGTGGCGTT CCGGCTGCCG GAGCCCCCTC GGGGGCAGCA CGAGATGCGC
TCGGTGCTGG GCGAGGTGGT CGAGGACGCC AAGCCGATGC CGATCTCGGT AGAGGCCGCC
TTCGCGCGGC TGATGCGGAT CCGCTCCGTG AAGACGTCGA TCATTGCCTT CTCGTCGCTG
GGATTCAGCC TGTTCACCAC GGGCATTCTG GCCAATCTTT GGGCCGAGGA CCACTACGGC
ATGTCGACGT TCCAGCGCGG GCTGATGGGA TCCCTCGGCG GGGCCGCCCT GCTGGTCGCC
CTGCCCATCG TCGGGCCCCG GTACGACCGG CTGTACCGCC AGGATCCCGC GCGGGCGGTG
GCGCTGCTCG GCTTCTGCAT CCTGCCGGGC GCGGTCCTGC TACCGGTGCA GTGGTTCATG
CCGCACTGGA TTGGGTTCAT GCTGTTCAGC ATTCCCGGCG CGGTGTTCGC CTCGGTCGCG
TTCGCAATGG TCGGCCCGGT GATGCAGTCC GTCGTGCCCT ACCGGCTTCG CGGCCTTGGG
ATAGCGCTGG CCGCGACCTA CATGTTCTTC GTCGGTGCCA CCGGCGGGGC GATTTTGTCC
GGGCTCATCA GCGACGCCTA CGACCCGCGG GTGGCGGTAC TGGTCATCGG GATCCCGTCA
ACCCTGGTCG GCGGCCTGAT GATGATCCGC AACTCGTCGT TCGTGCGCCA CGACCTGTCG
CTGGTCGTGG CCGACCTGCA CGAGGAGTTG GCCGAGCGCG ACCGGCAGCG CGACGACCCG
GAGAACGTTC CTGTGTTGCA GGTCAACAAC ATCGATTTTT CTTACGGCCA GGTACAGGTG
CTGTTCGACG TCGCCTTCGA CGTACGCAAG GGCGAGACGC TCGCGCTGCT CGGCACGAAC
GGCGCCGGGA AGTCCACCAT CCTCAAGGTC ATCTGTGGGC TGGGCACCCC GTCACGGGGC
GTTGTGCGGC TCAATGGTCG GACGATCACG TACGTCGCCC CGGAACAGCG CGGCAGGTAT
GGCGTACACC TGCTCCCGGG TGGCAAGGGT GTGTTCCCTG CGATGACCGT GCGGGAGAAC
CTGGAGATGG CCGCGTTCCG GATGCGCCGG GACCGCGCCG GCCGCGACCA TCGTTTCGCC
TACGTACTTG ACCTGTTCGA GGATCTGAAG GACCGCCAGT CGCAGCGGGC TGGCTCGCTG
TCCGGCGGGC AGCAGCAGAT GCTCGCACTC GCCATGGTGC TGCTTCACGA CCCCGAGGTG
CTACTGATCG ATGAACTCTC CCTGGGGTTG GCACCGGTGG TCGTCGCAGA CCTCCTGGCG
ATTCTGGAGC GACTCAAGGC CGACGGCCTG ACGATCATCG TGGTGGAACA GTCACTGAAC
ATCGCCCTGG CCATCGCCGA CCGGGCCGTG TTCCTGGAGA AGGGCCAGGT CCGCTTCACC
GGACCAGCCC GGGAGCTGGC CGAACGCGAC GACCTCGCGC GCGCAGTGTT CCTCGGCCGG
GAAGGCGGCT GA
 
Protein sequence
MRPGATHQCP DPSLWRALRP TGRLAAIFRK QADGVQEKRI LVITDVLAAP VNTPRGAATG 
AEGDGARETR GGATGTAPSV AGLAAGLIDA ETERRAQQAE SQREVLFADE LLPGVGGEPL
TLRQGLAAGG SLTFLTLVVL AALDELESAA LTVLAPDIRD AFGISDSAIV FISAAAGAFL
VLGALPMGWL ADHLRRSRLI GWAGVAFSVM VLASGLAANA FLFFLARFGV GVAKSSNTVQ
ASLLADAYPI GVRGRISATT YGAARTAGAI SPLLVAGIAT WVGGDDGWRW PFLILGLPAL
AIAVVAFRLP EPPRGQHEMR SVLGEVVEDA KPMPISVEAA FARLMRIRSV KTSIIAFSSL
GFSLFTTGIL ANLWAEDHYG MSTFQRGLMG SLGGAALLVA LPIVGPRYDR LYRQDPARAV
ALLGFCILPG AVLLPVQWFM PHWIGFMLFS IPGAVFASVA FAMVGPVMQS VVPYRLRGLG
IALAATYMFF VGATGGAILS GLISDAYDPR VAVLVIGIPS TLVGGLMMIR NSSFVRHDLS
LVVADLHEEL AERDRQRDDP ENVPVLQVNN IDFSYGQVQV LFDVAFDVRK GETLALLGTN
GAGKSTILKV ICGLGTPSRG VVRLNGRTIT YVAPEQRGRY GVHLLPGGKG VFPAMTVREN
LEMAAFRMRR DRAGRDHRFA YVLDLFEDLK DRQSQRAGSL SGGQQQMLAL AMVLLHDPEV
LLIDELSLGL APVVVADLLA ILERLKADGL TIIVVEQSLN IALAIADRAV FLEKGQVRFT
GPARELAERD DLARAVFLGR EGG