Gene Franean1_5944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5944 
Symbol 
ID5674265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7245544 
End bp7247322 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content73% 
IMG OID641244792 
ProductABC transporter related 
Protein accessionYP_001510194 
Protein GI158317686 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGAGG TGCGTCTTCT CGCCGACGAG CCGAACGTCC GCGGCGCGGG GCGCGGGGCG 
CTGCGCCACC TGCGCCCGCA CCGGCGGATC CTGGTGCTGG CGATCATGGG CACGATCGCC
AGCACCGCCT CCCTCGTCGC GATCGCGCCG GTCGTCGGCC GCGGCGTCGA CGCGGTGCTC
GCGCACGACC GCACGGCGCT GTGGGTGTCG GTCGCCCTGC TCGTCGTGGT CGTGCTCGCC
CGCCTGCTCC TGCTGCGGTG GTCGGAGGTG GTGCTCGCCC GCGCGGGCGA ACGGATCGTG
CACGACCTGC GCGATCTCGT GGTGGAGCGG CTGGCGAGCG CCCCGCTGCG GTTCGTGGAG
GCGCATCGGA CCGGTGACCT GCTCCGCCGG GCCACCGGCG AGATCGCCGA CCTGTCGCTG
TTCATCCGGG AGCAGTTGCC CAACCTGCTC GGTCTCGCAC TCACCGTCGT GCTGACGACG
GCCGTCCTGA TCGTGTACTC GCCGTTGCTG TCCCTGGTGC TCGTGCTGCT GTTCCTCCCC
GCCGCGGTGG GGGTCATCCG GTGGTTCAAC GCCTCGGCCA AGGTGGTCTT CGGGCGGCAG
GCCGCGGCCG ACGCCGCGAT GACCGCGACC TTCACCGAGA CCCTGGCGGC CGGTGAGGCT
CTGGTGGTGG CCGGGCGGCC GGGCGAATGG GTGAACCGGT TCCGCCGCGA CAACGACGAG
CTGCTGCGGG CGTCGAACGC GACGATCGGT GCTCAGAACC GGCTCGAGCT GTTCAACCTC
CTCGAAGGGC TGGCGACCGC GGTCCTGCTG CTGCTGAGCG TGTGGCTGGC CCGGGCCGGC
CACCTCGGGG TCGGCACGGT CGTGGTGTTC GTGCTCGCGA CCCGCAACCT GTTCGACGGA
ATGATGGGGC TCTCCCGGCT GGTCGGCGAG CTGCAGACCG CCAAGGTGGG CCTGGCCCGG
CTGGCCGACC TGCTGGGCGC GACCGAAGCG GCCCAGACAG CCGCCGCGCG GGCGGTGCTG
CCGCCGCGGG GCGAGCTCGT CGCGAACGGC GTGCGCTTCG GATACGCGGA GGGCGACGAC
GTGCTGCGCG GCGTCTCGGT GCGGTTCCCG GCTGGTGACC GCGCCGGCCT GGTGGGCAGC
ACCGGCTCCG GGAAGACGAC GCTCGCGAAG CTGCTGTGCG GCCTTTACCA GCCGGACGCC
GGCCTGGTGA CGTTCGGCGG GGTCGACCTG CGGACGGTCC CGGCAGCCGA GATCCGCCGG
CGGATCGTCC TCGTCCCCCA GCAGGTCCAC ATCATCACCG GAACGCTGGC GGAGAACCTG
GCGCTGGCCC CGGGCGAGCC GGACCGGGCG GCGATGGAGC GGGCGGTGGA GGCCCTCGGC
CTCACCGAGT GGGTGGCCGG GTTGTCCGGC GGGCTGGACG CCTCCGTCGG CTCGCGCGGG
GAGCTGCTGT CCGCCGGTGA GCGGCAGCTC GTCGGCCTGG TGCGGGCCGC TCTGGTCGAC
CCGCCGGTGC TGCTGCTCGA CGAGGCCACC GCCGATCTGG ATCCCGCCGT CGCGCGGCGG
CTCGAGACCG CGGTGGAGCG CATCCGCCCC GGACGGACGC TGATCGTCAT CGCGCACCGG
CAGTCCACCA TCGACCGGCT GCCGCGGCGG GTGCGCCTTG CGTCCGGGCT GATTGTCGCC
GGTGATGTGG CCGGTGTTGT GTCCGGCGTG GTTGACGAGC CAGGTTTCTC TGTACCCGGT
CAGCCACAGA ACGGGCACGT CACAACCGAT CGGTCGTAA
 
Protein sequence
MAEVRLLADE PNVRGAGRGA LRHLRPHRRI LVLAIMGTIA STASLVAIAP VVGRGVDAVL 
AHDRTALWVS VALLVVVVLA RLLLLRWSEV VLARAGERIV HDLRDLVVER LASAPLRFVE
AHRTGDLLRR ATGEIADLSL FIREQLPNLL GLALTVVLTT AVLIVYSPLL SLVLVLLFLP
AAVGVIRWFN ASAKVVFGRQ AAADAAMTAT FTETLAAGEA LVVAGRPGEW VNRFRRDNDE
LLRASNATIG AQNRLELFNL LEGLATAVLL LLSVWLARAG HLGVGTVVVF VLATRNLFDG
MMGLSRLVGE LQTAKVGLAR LADLLGATEA AQTAAARAVL PPRGELVANG VRFGYAEGDD
VLRGVSVRFP AGDRAGLVGS TGSGKTTLAK LLCGLYQPDA GLVTFGGVDL RTVPAAEIRR
RIVLVPQQVH IITGTLAENL ALAPGEPDRA AMERAVEALG LTEWVAGLSG GLDASVGSRG
ELLSAGERQL VGLVRAALVD PPVLLLDEAT ADLDPAVARR LETAVERIRP GRTLIVIAHR
QSTIDRLPRR VRLASGLIVA GDVAGVVSGV VDEPGFSVPG QPQNGHVTTD RS