Gene Franean1_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3169 
Symbol 
ID5671546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3734741 
End bp3735949 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID641242064 
Productputative Leu/Ile/Val-binding lipoprotein transmembrane 
Protein accessionYP_001507484 
Protein GI158314976 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTAC GTCGACGGAC AAGACTGGTG GCCGCGACAG CCGCGATTCT TGCGGCTACC 
ACGGCCAGTT GTTCGACATC CGACTCCAAA AGCGAGGCAG CTGCCGCCTG TGAGAGCCCT
GGCGTCACCT CTGACCAGGT GAAGTTCGGC CTGGTCTTCT CCGACTCGGG AGCCGGAAAC
CAGGCACTGT CCTCGGCTCG TGCCGGGGTC GACGCTAGGA TCGGCCTGGC GAACCAGGAA
GGTGGAGTCA ATGGCCGTCG CCTCGTCTAC GAATGGCGCG ACGACGCGGC CTCCCCATCG
CAGAACGCCA AGGTGGTCGA CGAGCTCGTC AACCAGGAAT CGGTGTTCGG AGTCGTCGCT
GTCACCACGT CGGCCAGCGG TTCGGTCGAG AGCCTGGGAT CGGCGGGCAT CCCGGTCGTG
GGGCTGGCCG ACGCCACCTG GAAGATCCAC CCGAACATGT TCTCGTACTC CTATGAGACG
TCACCGCAGA TCATCGGCCG ATATCTCCAG CTCAACGGCG GAACGAAGGT TGCTTTTGTC
ACCACCGGCT CGTCGGCCTC CGCTGTTCGC TACACAGAAC GGTATGCGTC GGCCATGCGG
GCGATGGGGC TGACCGTGGT CGGAACGGCA TCCTACTCAA GCGGCGACAG CCCGGTTCGG
GTGGCCCAAC AGCTGGCCGA CTCCGGAGCC AACGTCCTGG TGGGCCTCAC CACCCCGAAC
GACATCGCCA GCATCATGCA TGCGGCTCGC ACCATAAACG CCAGTTTCGC CGCCACCGTC
TCGCTCGCCG GGTACGACCG CGGTGTGCTG AGCACGCTGG GGACGGACCT GGCTGGCGTC
TCGTTCCCGG TGTACTTCCG CCCCTTCGAG GCCGGCGGAC CGGCCATCGA CCACTATCGA
AACGCGATCA CACAGTTCGC TCCGGAACTG GTCATGCCCG AGCAGCAGTT CGCGATGTAC
GGCTACATCT ACGCGGACCT GTTCATCCGG GGGCTCCAGA AGGCCGGCGC CTGCCCCACC
CGGGAGAGCT TCATCCGCGG ACTGCGCACG GTGACCGGCT ACGACGCCGG TGGTCTGATC
GAACCGGTCG ACCTCGCCAC CAACATCAAC AAGCCGCTTG ACTGCAGCGC ATTCGTCCAG
GTCGACCCAA CGGGCCGCAC CTTCCAGGTC ACCCAGGAAC GCCTCTGCGC CGACGGTACG
GGAAGCTAA
 
Protein sequence
MHLRRRTRLV AATAAILAAT TASCSTSDSK SEAAAACESP GVTSDQVKFG LVFSDSGAGN 
QALSSARAGV DARIGLANQE GGVNGRRLVY EWRDDAASPS QNAKVVDELV NQESVFGVVA
VTTSASGSVE SLGSAGIPVV GLADATWKIH PNMFSYSYET SPQIIGRYLQ LNGGTKVAFV
TTGSSASAVR YTERYASAMR AMGLTVVGTA SYSSGDSPVR VAQQLADSGA NVLVGLTTPN
DIASIMHAAR TINASFAATV SLAGYDRGVL STLGTDLAGV SFPVYFRPFE AGGPAIDHYR
NAITQFAPEL VMPEQQFAMY GYIYADLFIR GLQKAGACPT RESFIRGLRT VTGYDAGGLI
EPVDLATNIN KPLDCSAFVQ VDPTGRTFQV TQERLCADGT GS