Gene Franean1_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3271 
Symbol 
ID5671645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3875115 
End bp3876323 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID641242163 
Productputative Leu/Ile/Val-binding lipoprotein transmembrane 
Protein accessionYP_001507583 
Protein GI158315075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTAG GGCGACGGAA AAGGCTGGTG GCCACGACGG CCGCGATTCT TGCGGCTACC 
GCGACCAGTT GTTCGAACTC CGGTTCCGGA AGCGATCCTA TTGTCGCCTG CGAGAGCCCT
GGTGTCACCT CTGACCAGGT AAAGTTCGGC CTGGTCTTCT CCGACTCGGG AGCCGGAAAC
CAGACACTGT CCTCGGCTCG TGCCGGGGTC GACGCCAGGA TCGGCCTGGC GAACCAGGAA
GGTGGAGTCA ACGGCCGTCG CCTCGTCTAC GAATGGCGCG ACGACGCGGC CTCCCCATCG
CAGAACGCCA AGGTGGTCGA CGATCTCGTC AACCAGGAAT CGGTGTTCGG GGTCGTCGCT
GTCACCACGT CGACCAGCGG TTCTATCGAG AACCTGGGAT CGGCGGGCAT CCCGGTCGTG
GGTCTCGCCG ACGCCACCTG GAAGACCCAC CCGAACATGT TCTCGAATTC CTACGAGACG
TCACCGCAGA GCGTAGGCCA ATATCTCCAG GCCAACGGCG GAACAAAGGT CGCTTTTGTC
ACCACAGGCT CGTCAGCCTA CACAGTGGGC TATGCCGAAC AGTATGCGTC GGCCATGCGG
GCGATGGGGC TGACCGTGGT GGGAACGGCG TCGTACTCAA GCGGCGACAG CCCGGTTCGG
GTGGCCCAAC AGCTGGCCGA CTCCGGAGCC AACGTCATCG TGGGCCTCAC CACCCCGAAC
GACATCGCCA GCATCATGCA TGCGGCTCGC ACCATAAACG CCAGTTTCGC CGCCACCGTC
TCGCTCGCCG GGTACGACCG CGGTGTGCTG AACACGCTGG GGACGGACCT GGCTGGCGTC
TCGTTCCCGG TGTACTTCCG CCCCTTCGAG GCCGGTGGAC CGGCCATCGA CCACTATCGA
AACGCGATCA CACAGTTCGC TCCGGAACTG GTCATGCCCG AGCAGCAGTT CGCGATGTAC
GGGTACATCT ACGCGGACCT GTTCATCCGG GGGCTCCAGG AGGCCGGCGC CTGCCCCACC
CGGGAGAACT TCATCAGCGG GCTACGCCCC GTGACCGGCT ACAACGCGGG TGGTCTGATC
GAACCGGTCG ACCTCGCCAC CAACATCAAC AAGCCGCTTG ACTGCAGCGC ATTCGTCCAG
GTCGACCCAA CGGGCCGCAC CTTCCAGGTC ACCCAGGAAC GCCTCTGCGC CGACGGTACG
GGAAGCTAA
 
Protein sequence
MRLGRRKRLV ATTAAILAAT ATSCSNSGSG SDPIVACESP GVTSDQVKFG LVFSDSGAGN 
QTLSSARAGV DARIGLANQE GGVNGRRLVY EWRDDAASPS QNAKVVDDLV NQESVFGVVA
VTTSTSGSIE NLGSAGIPVV GLADATWKTH PNMFSNSYET SPQSVGQYLQ ANGGTKVAFV
TTGSSAYTVG YAEQYASAMR AMGLTVVGTA SYSSGDSPVR VAQQLADSGA NVIVGLTTPN
DIASIMHAAR TINASFAATV SLAGYDRGVL NTLGTDLAGV SFPVYFRPFE AGGPAIDHYR
NAITQFAPEL VMPEQQFAMY GYIYADLFIR GLQEAGACPT RENFISGLRP VTGYNAGGLI
EPVDLATNIN KPLDCSAFVQ VDPTGRTFQV TQERLCADGT GS