Gene Franean1_3151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3151 
Symbol 
ID5671528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3706563 
End bp3707864 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content60% 
IMG OID641242046 
Producthypothetical protein 
Protein accessionYP_001507466 
Protein GI158314958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT GGCGCTGCAG GCAGTCCCAG ATAGTAGTCG GCACCGCCAC AGCCCTCATT 
CTTGTGGCGG GCTGCAGTGT CGAGGCGAAC GACGAGTCGA CGAATCCTTC GACAGGCACG
CCAGCGGTAG CACCCGCGCC GGACATCTCA CCGGGTGTCA CAGCCAATAC CATCAAGATC
GGCTTCGTGT ACCCCAACCT CTCCGCCGTC AAGAAGTTCA TAAACATCGA TCACGGCGAC
TACGAGGCAA CCTTCACGGC ATTGGTCGAC AAGGTTAACT CCTCCGGCGG CATCAACGGC
CGGAAGCTCC AGCCCGTCTT CGGCGCGGTT GACGTCACCT CGCCTGCTGG CGCTCAGGAG
ACCTGTCTCA AACTGACCCA GGACGAGAAG GTTTTCGCTG TCCTCGGCAG TCTCAGCGGC
GACGAGCCGC TCTGCTACAT CCAGACGCAT AGGACGGCGC TCGTCGGCGG CACGCTCTCG
CCGGATCGCT ATGCCAAGGC CCAGGCACCC TGGTTCTCAT ATCAGCGAGG CGGCGACGAG
GCTGCGGAGG GCATCAAGCT CTTTGCCGCG GACGGCGGCT TGGACGGGAA AGTCGCGATC
GTCTCCTCTC TCAACGAGGA GGGTGTTATG AAGACGGCCA TCATGCCAGC TCTGAAGGAG
CTCGGTATCA CACCGGTGGC AGCCGGGGTG CTCGATGCAC CGGCCACGGA TCCGGCAGCC
GTTGCTCAGC AACTCAACGT CTTTGTGCAG AAGTTCCAGT CCGCTGGCGC CGACACCGTG
ATCGTTGTCG GTGGGGTCTC GAGCGAGTTT CCCAAAAGCC TGGAGAAAAC CGACTATCGC
CCCAAACTTC TCTTCAGCGA AATCAACCAG GCCGAACTCT ATTCGAACGA TCCCGGCGAG
CATGATTTCA GCACACTGAA AGATGCGGCC GCCGTCGGCC TTGGTGTCAA CTGGAACGAC
CCAGCGAACC TAGAATGTGT CAATACGCTC GTGGCAGCTC ATCCTGATCT GAAGGAGACA
CTCATCGATC CGAACGACGT GGAGTCCGGA GAGCCTCAAC TGGGAGTTTC CGCGGGTATC
GCCTGCAGCT CCCTCGCGCT ATTCACCGCT ATCGCAGAGA AGGCAGGCGG GACCCTTAGC
TACAAGACAT TCCAGGATGC CGCTTTCTCC TTGGGTTCCT TCCATGTTCC TGGCTTCATG
GACGATGCCA CATATGGCCC CTCCACACCC GATGGTCGGA TCCCGCCCCG CCTGTTCGAG
TACAGCGCCA CAGAGAAGAA CTTCAAGATG TCCACAGGTT GA
 
Protein sequence
MRKWRCRQSQ IVVGTATALI LVAGCSVEAN DESTNPSTGT PAVAPAPDIS PGVTANTIKI 
GFVYPNLSAV KKFINIDHGD YEATFTALVD KVNSSGGING RKLQPVFGAV DVTSPAGAQE
TCLKLTQDEK VFAVLGSLSG DEPLCYIQTH RTALVGGTLS PDRYAKAQAP WFSYQRGGDE
AAEGIKLFAA DGGLDGKVAI VSSLNEEGVM KTAIMPALKE LGITPVAAGV LDAPATDPAA
VAQQLNVFVQ KFQSAGADTV IVVGGVSSEF PKSLEKTDYR PKLLFSEINQ AELYSNDPGE
HDFSTLKDAA AVGLGVNWND PANLECVNTL VAAHPDLKET LIDPNDVESG EPQLGVSAGI
ACSSLALFTA IAEKAGGTLS YKTFQDAAFS LGSFHVPGFM DDATYGPSTP DGRIPPRLFE
YSATEKNFKM STG