Gene Franean1_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3554 
Symbol 
ID5671923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4216322 
End bp4218112 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content74% 
IMG OID641242440 
ProductABC transporter related 
Protein accessionYP_001507860 
Protein GI158315352 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.32539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCGGC GGCTGTTCAA CCTGCTCGGC CCGGAGCACC GGGGCGAGCT GCGGAGCCTG 
CTCGGCTGGC TCGTCGGCGC GGCCGTCCTG CAGGGCGTCG CCTTCGTGCT GCTGGTGCCG
ATCCTGCGCG CGCTGCTCGG CGACGACCCG AATGAGGTCT GGCCCTGGAT CTGGGCGCTC
GCGGCGACGA CGGCGGGCTA CGGGTTCGCG CATCGGGTGG GCCTGTTGGC CAGCGCCGAC
GCGGGCGCCG GCCTCTCACG CACCGTGCAC CACCGCATCG GCGACCGGGT CGCGGGCCTC
CCGCTCGGCT GGTTCTCGGC CGACCGGACG GCGGCGGTCA ACCAGCTCGC GACGCGGGAC
GTGATGGACG TCATGGGCGT GTTCGCGCAC CTGCTGCGGC CACTGCTGAC CGGTTTCGTG
ACCCCCGGTG TCGTCGTCGC GCTGATGTAC GTCTTCGACT GGCGGCTCGC GCTCGCCGCC
ACGCTCACCG TCCCGATGCT CGCACTGGTC TACCGGTGGA GCACCCGGCT CGCCCACCAG
GCCGACGCCG CCAGCGACGC CGCCCAGGTG GACGCCGGCG GGCGGGTGCT CGAGTTCGCC
CGGGCCCAGC GGGTGCTACG GGTGTTCGGC CGCGGGGCGA CCGGGTCCGC CGCCCGGGCG
CTCGACGAGG CCCTGGAGCG GCAGCGGGCG GCCGGGCACC GGCAGTTGCG GGTGGCGGTG
CCGGGGCTGA TCGGTTTCGC GGTCGCGGTC CAGGCCGCGT TCACCGTGGT GATCATCCTC
GGCACCTACC TCGCGCTCGA CGGCTCGCTG GACGCGCCCA CGCTGCTCGC GGTCCTCGCC
CTGGCGGCCA GGTTCATCGA GCCCGTCGCG GAGGCCGCGG CGCTGGGCAC CGAGCTGCAG
AAGGCCAGCC GTGCCCTCGA GCGCATCGAG AACCTGCTCG CCCTGCCGAC GCTGCCGCAA
CCGGCGGCCC CCCGGCACCC CGAGGGCACC GGCATCGAAC TCGACAACGT CGTGTTCGGG
TACGAGGGCC GCCGCGTGCT CTCCGGGGTG TCCGCCCGGC TGCCCGAGGG CACGATGACC
GCGCTGGTGG GCCCATCGGG CTCCGGCAAG ACCACGATCA CCCGGCTCAT CGCACGTTTC
TGGGACACGG ACTCGGGAGC CGTGCGGATC GGCGGGGTCG ACGTCCGGGA GATCGATCCC
GACGAGCTGA TGTCGCTGAT CTCCGTGGTC TTCCAGGACG TCTACCTGTT CGAGGGCAGC
ATCGAGGACA ACATCCGGGT CGGGCGTCCC GGGGCGAGCG CCGAGCAGGT GCGCGAGGCC
GCGCGGCTGG CCCGGGTCGA CGAGATCGTC GCGCGGCTGC CCGAGGGCTG GGACACCCGC
GTCGGGGAGG GCGGCAGTGC CCTGTCCGGC GGCGAGCGGC AGCGGGTCTC CATCGCCCGC
GCGATCCTCA AGGACGCCCC GATCGTGCTG CTCGACGAGG CGACGTCCGC GCTGGACCCG
GAGAACGAGC AGGCCGTCCA GGAAGCTCTG GCCGCGCTGG CGAGCAACCG CACGCTGCTC
GTGATCGCGC ATCGGCTGGG GACGGTGATG GCGGCCGACC AGATCCTCGT CCTGGAACGC
GGGCTTGTCG TCGAGTCCGG AACCCATGCC CAGCTCATCG GGACCGGCGG CCGGTACGCG
GCCTTCTGGC GCCGCCGCCG CGGCGCCGAG GGCTGGCGTC TGGCGCCCGG CGCCGTTCCC
ACCGGTGAGT CCGCGCTCGC CCGTCTTCCC CACCCGGCCG CGGGCCGCTA G
 
Protein sequence
MIRRLFNLLG PEHRGELRSL LGWLVGAAVL QGVAFVLLVP ILRALLGDDP NEVWPWIWAL 
AATTAGYGFA HRVGLLASAD AGAGLSRTVH HRIGDRVAGL PLGWFSADRT AAVNQLATRD
VMDVMGVFAH LLRPLLTGFV TPGVVVALMY VFDWRLALAA TLTVPMLALV YRWSTRLAHQ
ADAASDAAQV DAGGRVLEFA RAQRVLRVFG RGATGSAARA LDEALERQRA AGHRQLRVAV
PGLIGFAVAV QAAFTVVIIL GTYLALDGSL DAPTLLAVLA LAARFIEPVA EAAALGTELQ
KASRALERIE NLLALPTLPQ PAAPRHPEGT GIELDNVVFG YEGRRVLSGV SARLPEGTMT
ALVGPSGSGK TTITRLIARF WDTDSGAVRI GGVDVREIDP DELMSLISVV FQDVYLFEGS
IEDNIRVGRP GASAEQVREA ARLARVDEIV ARLPEGWDTR VGEGGSALSG GERQRVSIAR
AILKDAPIVL LDEATSALDP ENEQAVQEAL AALASNRTLL VIAHRLGTVM AADQILVLER
GLVVESGTHA QLIGTGGRYA AFWRRRRGAE GWRLAPGAVP TGESALARLP HPAAGR