Gene Franean1_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3189 
Symbol 
ID5671565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3759609 
End bp3760916 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content69% 
IMG OID641242083 
Producthypothetical protein 
Protein accessionYP_001507503 
Protein GI158314995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGC ACCATCGCTG GCGCCGTCCG CTCATGCTGA TCGCCGCGGG AGCCGCACTC 
GTCTTAGCCG GTTGCAGCGT CGAGTCGGAC GACCCGCCGG CCAGCAGCCC CACCTCGGAC
CCGTTCCCGG CCCCGCCGGC GCCGGCGATC TCGCCGGGTG TCACGGCCGA CAGCATCAAG
ATCGGTTTCG TCTACCCCGA CCTCGAGGTC GTCAAGCAGT ACGTCGACAT CGACCACGGC
GACTACCAGG CCACCTTCCA AGCCCTGGTC GACAAGGTCA ACGCCACGGG CGGCATCAAC
AGCCGGAAGA TCATCCCGGT GTACGGCGCG GTCGACGTCA TCTCCCCCGC CGGCGCGCAG
GAGACCTGCG TCAAGCTGAC CCAGGACGAG AAGGTCTTCG CGGTGCTCGG CAGCCTCAAC
GCCGAGGACG CGCTGTGCTA CGTCCAGACC CACAAGACGG CGCTCGTCGG CGGTGACCTC
ACCACGCAGC GCTACGCCAA GGCGCAGGCA CCGTGGGTCT CCGACCTGCG CGGCGGCGAC
GAACTCGCCG ACGGCATCGA GCTGTTCACC GCCGACAACA CCCTGGCGGG CAAGAAGCTC
GCGGTCGTCG CCTACCGGGA CGACCAGGCG ACGCTGGACA AGGTGGTCCT GCCGGCCCTG
CAACGTCTCC AGGTCCCGGT GACCGAGACA GGCATCCTCG ACGCCGACAT CAGTGACGCG
GCCGCGGTCT CCCAGCAGTT CAACGTCTTC ATCCAGAAGT TCCGGGCCGC GGGCGTCGAC
ACCGTGCTCC TGGTCGGCGG CTCCCAGCTC CAGTTCCCCG CCGAACTCCA GAAGACCGAC
TACCGCCCCA GGCTGATGTT CGCGACCACC AGCCAGGCCG GGGCCTACCT GGCGAGCCCG
GGCGACCACG ACCCGGCCAT CATGGCCGGC GCCACCGCCC TCGGGCTCGT CGTCGACTTC
ACCGAGCCGG CCAACGCCGC GTGCATCGCC ACCCTTGAGG CCGCGCTGCC AGCACTCACC
GGCAAGCTGG TCGACCCGGC GACCGTGCCC TCAGGCCAGC CCACCCCCGG AACATCCGAA
AGCGCCGCCT GCCGCTACCT GACCCTGTTC CAGGCCATCG CCGAGAAGGC CGGCAAGGAC
CTCACCTACC AGTCCTTCCA GCAGGCCGCG TTCTCCCTCG GCTCCTTCCA GGTCCCCACC
TACCGGGACA AGGCCACCTA CAGCCGCGAG ACACCCCACG GCGCCGTCCC CCCGCGCCTG
TTCACCTTCG ATCCCGCGAA GAAGAACTTC TTCCCCGCCG CGGGCTGA
 
Protein sequence
MTSHHRWRRP LMLIAAGAAL VLAGCSVESD DPPASSPTSD PFPAPPAPAI SPGVTADSIK 
IGFVYPDLEV VKQYVDIDHG DYQATFQALV DKVNATGGIN SRKIIPVYGA VDVISPAGAQ
ETCVKLTQDE KVFAVLGSLN AEDALCYVQT HKTALVGGDL TTQRYAKAQA PWVSDLRGGD
ELADGIELFT ADNTLAGKKL AVVAYRDDQA TLDKVVLPAL QRLQVPVTET GILDADISDA
AAVSQQFNVF IQKFRAAGVD TVLLVGGSQL QFPAELQKTD YRPRLMFATT SQAGAYLASP
GDHDPAIMAG ATALGLVVDF TEPANAACIA TLEAALPALT GKLVDPATVP SGQPTPGTSE
SAACRYLTLF QAIAEKAGKD LTYQSFQQAA FSLGSFQVPT YRDKATYSRE TPHGAVPPRL
FTFDPAKKNF FPAAG