Gene Franean1_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3191 
Symbol 
ID5671567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3763859 
End bp3766105 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content70% 
IMG OID641242085 
ProductABC transporter related 
Protein accessionYP_001507505 
Protein GI158314997 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0410] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC CCACCGAACC GGTGACCGAG GCACCAGAAG CGGCACCAGA AGCGGCGTCG 
GCGGGGGTCG CTGCCAGGCT GGCCGCTGGG CTGATCGAGG CGGAGGCGCA GCGGCGGGCG
CAACAGGCGG ACTCCCGGCG TGAGGTGTTG TTCGCGGACG AGCTGCTGCC CGGTGTGGGC
GGTGAGCGGC TGTCGTTGCG GCAGGGCCTG GCCGCCGGTG GTTCGGTGAC GTTCCTGACG
CTGATGACGC TGTCGGCGCT GGACGAGCTG GAGTCGGCGG CGCTGACGAC ACTGGCACCC
GACATCCGGG ACGCGTTCGG GCTGAGCGAC GGCGCGATCG TGTTCATCTC CGCGGCCGCC
GGGGCGTTCC TCGTGCTGGG CGCATTGCCG ATGGGCTGGC TGGCGGACAA ATGCCGGCGC
TCCCGGGTCA TCGGCTGGGC GGGTGTCGTG TTCTCGGTGA TGGTGCTGGC GAGCGGACTG
GCGGCGAACG CGTTTCTGTT CTTCCTGGCC CGCTTCGGTG TCGGGGTGGC GAAGTCGAGC
AACAACTCGG TACACGGATC GTTGCTCGCC GACACCTATC CGATCGGAGT CCGGGGGCGG
ATCTCCGCGG CGAACTTCGG GGCGGCGCGC ACCGCTGGAG CGCTCAGTCC CCTGGTGGTC
GCGGGCATCG CCACGCTGGC GGGTGGGGCT GACGGCTGGC GTTGGCCGTT CCTGATCCTC
GGCCTGCCGG CCCTGGCGGT GGCGGTTTTC GCGTTCCGCC TGCCGGAGCC GCCTCGCGGC
CAGCACGAGA TGAAATCGGT GCTGGGCGAG GTGGTGGAGG ACGCCGAGCC GATGCCGATC
TCGGTCGAGG CGGCATTCGC CAGGCTGCTG CGGATCCGGA CCGTCAAGAC CGCGATCGTC
GCATTCTCCG CGCTGGGTTT CAGCCTGTTC ACCACGAGCG TGCTGGCGAA TCTGTGGGCT
GAGGACCACT ACGGGATGTC GACGTTCCAG CGGGGCCTGA TGGGTTCACT CGGCGGCGCC
GCTCTGCTCG TGGCCCTGCC GATCGTCGGG CCACGCTACG ACCGGCTCTA CCACCGGGAT
CCGGCACGGG CCGTGGCTCT GCTGGGGCTG TGCATCCTGC CGGGTGCGGT GCTGCTGCCG
GTCCAGTGGT TCATGCCGAA CTGGGTCGGG TTCATGCTGG CCAGCATCCC GGGGGCGGTG
TTCTCCTCGG TCGCGTTCTC GATGGTGGGC CCGGTGATGC AGTCGGTGAC GCCCTACCGG
CTGCGCGGGC TGGGCCTGGC GCTGGCCGCG GTCTACATGT TCTTCATCGG CGCGACCGGC
GGCGCCATCC TCTCCGGGCT GATCAGCAAC GCCTACGACC CGCGGGTCGC GGTGCTGGTG
ATCGGGATCC CCGCGACCGC GGTCGGCGGG CTCATGATGA TCCGCAGCGC GTCCTTCGTG
AAGAACGACC TGTCGCTGGT CGTGGCGGAC CTGCGCGAGG AGCTCGCCGA ACGCGACCGG
CAGCGCGAGA ACCCGGACGA CGTCCCAGCG CTCCAGGTCA ACAACATCGA CTTCTCCTAC
GGGCCGGTAC AGGTCCTGTT CGACGTCGCC TTCGACGTGA AGAAGGGCGA GACCCTGGCG
CTGCTGGGCA CCAACGGCGC GGGCAAGTCC ACCATTCTGA AAGTCATCTG CGGCCTGGGA
ACCCCGTCGC GTGGGGTGGT CCGCCTCGGC GGGCGGACCA TCACCTACGT CTCCCCGGAA
CAGCGCGGGA AGTACGGCGT GCACCTGCTG CCCGGCGGCA AGGGCGTCTT CCCCGGGATG
ACGGTGCGCG AGAACCTGGA GATGGCCGCG TTCCGGATGC GCCGCGACGC GGCCGGCCGG
GAGCGGCGTT TCGCCTACGT ACTCGGCCTG TTCCCCGACC TCGAGAGCCG ACAGGGCCAG
CGGGCGGGGT CGCTGTCGGG CGGGCAGCAA CAGATGCTGG CGCTGGCGAT GGTGCTGCTG
CACGACCCCG AGGTGCTACT GATCGACGAG CTCTCCCTCG GGCTGGCCCC GGTGGTGGTG
CAGGACCTGC TGGCGATCCT CGAACGCCTC AAACAGGACG GCCTGACGAT CATCGTCGTC
GAACAGTCCC TGAACATCGC CCTCGCCATC GCCGACCGGG CGGTGTTCCT GGAGAAGGGC
CAGGTCCGTT TCACCGGGCC AGCCCGCGAA CTCGCCGAAC GCGACGACCT GGCACGCGCG
GTGTTCCTCG GCCGGGAAGG CGGCTGA
 
Protein sequence
MSTPTEPVTE APEAAPEAAS AGVAARLAAG LIEAEAQRRA QQADSRREVL FADELLPGVG 
GERLSLRQGL AAGGSVTFLT LMTLSALDEL ESAALTTLAP DIRDAFGLSD GAIVFISAAA
GAFLVLGALP MGWLADKCRR SRVIGWAGVV FSVMVLASGL AANAFLFFLA RFGVGVAKSS
NNSVHGSLLA DTYPIGVRGR ISAANFGAAR TAGALSPLVV AGIATLAGGA DGWRWPFLIL
GLPALAVAVF AFRLPEPPRG QHEMKSVLGE VVEDAEPMPI SVEAAFARLL RIRTVKTAIV
AFSALGFSLF TTSVLANLWA EDHYGMSTFQ RGLMGSLGGA ALLVALPIVG PRYDRLYHRD
PARAVALLGL CILPGAVLLP VQWFMPNWVG FMLASIPGAV FSSVAFSMVG PVMQSVTPYR
LRGLGLALAA VYMFFIGATG GAILSGLISN AYDPRVAVLV IGIPATAVGG LMMIRSASFV
KNDLSLVVAD LREELAERDR QRENPDDVPA LQVNNIDFSY GPVQVLFDVA FDVKKGETLA
LLGTNGAGKS TILKVICGLG TPSRGVVRLG GRTITYVSPE QRGKYGVHLL PGGKGVFPGM
TVRENLEMAA FRMRRDAAGR ERRFAYVLGL FPDLESRQGQ RAGSLSGGQQ QMLALAMVLL
HDPEVLLIDE LSLGLAPVVV QDLLAILERL KQDGLTIIVV EQSLNIALAI ADRAVFLEKG
QVRFTGPARE LAERDDLARA VFLGREGG