Gene Franean1_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3675 
Symbol 
ID5672041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4350277 
End bp4351503 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID641242558 
Producthypothetical protein 
Protein accessionYP_001507978 
Protein GI158315470 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.389336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA CTCGTTCACT GTCGTGGTTG TCCGCCGTGC TGTTATGCAC CGCCGCCCTG 
ACCGGCTGTC AGGGCGGCGC GACGCAGAGC GCGATGTCCT GTACCACCGA GGGTGTCACC
GCGGACGAGG TCAAGCTGGG GCTGTTGTTT CCGGACACCG GGCTCGGTGC GGTGACGTTC
AGTGCCGCCC GGGCCGGAAT CGACGCCCGA TTCGGAGCGG TGAACGCGGC CGGCGGTGTG
CACGGGCGCC AGATCGTCTA TGACTGGCGA GACGACACGG CCAAGGTGTC GATGAATCTC
ACGATGGCCC GCACACTGAC GGAGCGGGAA AGCATCTTCG GCATGCTCGA GACGAGCACG
GTGGCCTCGG GTAGCGCCGC CTATCTCGCG GAGCGCGGGA TTCCGGTGCT CGGGATCGCC
GTGGAGGACG CGTGGGCGAA GTACCGGAAC ATGTTCTCCT TCAACTACAG TTTCACGGCG
AAGGGCTCGG CCGACACGTT CGGCAAGTTC GTCCATGAGC GCGGCGGGAC CAAGGCGATG
ATCCTGTACA ACCCGCTCGA CCCGACCGTG TCGACGCACA TCGCCGAACA GTTCACCTCC
AGCTTCCGGT CGGTGGGAAT CACGACGACC TCGGTCGGCA CCGACGACAA CCCCGTCTCG
GCCCAGGCGG ACCAGCTCGC GCGGCAGATG GCCGAGGAGG GCATCGACAC GCTGGCCGGC
ACGCTGAGCA CGGAGGGCCT GGCCAAGGTC GTCGCGGCCG CGCGGCGCCA GGGCGTCCCG
CTCAAGGTCA TCCTCAGCAG CAGCCCCGCG CCGAACGCCG AGCTGTTGGA GACCTACGGC
TCGCAGCTGG CCGGTCTGAC GACGTTCGCC GCCTACATAC CGCTGGAGAC GAAGTCGCCC
GCGCTCGACG CCTACCGCGC CGCGATGGCC ACCTACGCCC CGCAGCTGCA GGACACCGAC
CAGACGCTGG CGCTGGTCGG CTACATCATC GCCGACATGT TCATCCGGGG CCTGGAGGAG
GCGGGGGACT GCCCGACCCG GCAGGGCTAC ATCGACGGCC TCCGGGCGGT GAAGGGCTAC
AACGCCGGTG GCCTCATCGG CGACATCGAC CTCGAACGTG ACTTCGGCAA GCCCGCCGAG
TGCTACTCGT TCGTCGAGGT GAACCCCGAA GGCTCCGCCA TCGAGATCGT CAGCCCGAAC
TACTGCGGAC ATCGGCTCGC CGACTGA
 
Protein sequence
MATTRSLSWL SAVLLCTAAL TGCQGGATQS AMSCTTEGVT ADEVKLGLLF PDTGLGAVTF 
SAARAGIDAR FGAVNAAGGV HGRQIVYDWR DDTAKVSMNL TMARTLTERE SIFGMLETST
VASGSAAYLA ERGIPVLGIA VEDAWAKYRN MFSFNYSFTA KGSADTFGKF VHERGGTKAM
ILYNPLDPTV STHIAEQFTS SFRSVGITTT SVGTDDNPVS AQADQLARQM AEEGIDTLAG
TLSTEGLAKV VAAARRQGVP LKVILSSSPA PNAELLETYG SQLAGLTTFA AYIPLETKSP
ALDAYRAAMA TYAPQLQDTD QTLALVGYII ADMFIRGLEE AGDCPTRQGY IDGLRAVKGY
NAGGLIGDID LERDFGKPAE CYSFVEVNPE GSAIEIVSPN YCGHRLAD