Gene Franean1_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3921 
Symbol 
ID5672282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4688805 
End bp4689704 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content69% 
IMG OID641242800 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001508217 
Protein GI158315709 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0195881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC CCGACGTGGT GGACCGCGAG GTCCACTTAC CGCCGGCCCC GGCGCTGGAC 
ACGCCGATCG TCCCGGCGGT GCGAAAGTCC CGCTCGATCC TGGTGTACCT GGCCTACGCC
TGGCTGATCG CGGTGATCGC GCTCGCCGCG CTGGCGGACG TGCTGCCGCT GGCGTCGTAC
TCCATTCCCA TCGGGAAGCC GCGCCAGGGA CCGGATTTCA GCTCGTTCGA CCTGTGGCTG
GGCACCGACC AGCAGGGCCG GTCGATCCTG TCGCGATGCG TGTACGGTGC GCGCGTCTCG
CTCCTGGTCG GAACCGTGGC GGGCCTCATC GGGGCCGTCA TCGGAACCCT GCTGGGCATG
CTCGCCGGCT ACCTCGGCAA GGCCGTCGAC TGGATCATCC GGCTGATCAC CGACGCGATG
CTGGCGTTCC CGCCACTGAT CCTGCTCCTG GCGCTGTCGT CCATCCTCAC GCCGAGCGTG
CGGACGCTGC TGGTCGGCCT GACGCTGCTG ATCATCCCGA CGTTCGTCCG GCTCGCGCTG
GCGAACACCC TCGCCTGGTC GTCCCGCGAG TTCGTCACCG CCGCCCGCAA CATGGGCGCG
GGGCACGTGC GGATCCTGGT GAAGGAGATC CTGCCGAACC TGCTGCCACC ACTGGGCGCG
TTCCTGCCGG TCGTGATGGC CGCGCTGATC GTGGCCGAAG GGTCGCTGAG CTTCCTGGGG
ATGGGCATCC CGCCGCCCCA GCCCAGCTGG GGCGGCATGA TCTCCGACGG CAAGGAGGCC
ATCGCCGACT CCCCGCACAT GGTGCTGGTG CCGGCGATCG TCATCTTCTT CACCGTCTTC
GCGCTGAACC AGGCGGGCGA CCACCTGCGC AGCCGCTTCG ACCGCACGAT GCGCGACTGA
 
Protein sequence
MTTPDVVDRE VHLPPAPALD TPIVPAVRKS RSILVYLAYA WLIAVIALAA LADVLPLASY 
SIPIGKPRQG PDFSSFDLWL GTDQQGRSIL SRCVYGARVS LLVGTVAGLI GAVIGTLLGM
LAGYLGKAVD WIIRLITDAM LAFPPLILLL ALSSILTPSV RTLLVGLTLL IIPTFVRLAL
ANTLAWSSRE FVTAARNMGA GHVRILVKEI LPNLLPPLGA FLPVVMAALI VAEGSLSFLG
MGIPPPQPSW GGMISDGKEA IADSPHMVLV PAIVIFFTVF ALNQAGDHLR SRFDRTMRD