Gene Lcho_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3644 
Symbol 
ID6162681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4069991 
End bp4071028 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID641666417 
Productbiotin synthase 
Protein accessionYP_001792663 
Protein GI171060314 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG CCACCATCTC CCTGTCCACC CTGCAAGCGT CGCGCCCCAG CGTGGCGGCG 
CGCGCCGATG CGGCGGCGCG CTGGCGCGTG GCCGACGTCG AGGCGCTCTA CGCGCTGCCC
TTCATGGACC TGCTGTTCCA GGCCCAGCAG GTGCACCGCG CCCACTTCGA CGCCAACGAG
GTGCAGCTGT CGACGCTGCT GTCGATCAAG ACCGGCGGCT GCGCCGAAGA CTGCGGCTAC
TGCCCGCAGT CGGCCCACTT CGACACCGCC GTCGAGGCCA GCAAGCTGAT GCCGATCGAC
GAGGTGCTCG ATGCCGCCAA CGCCGCCAAG GCGCAGGGCG CGACCCGCTT CTGCATGGGT
GCCGCCTGGC GCAGCCCGAA GGAGCGCGAC ATGGAACGCG TGACCGAGAT GGTGCGCGAG
GTGCGTGCGC TGGGCCTGGA GACCTGCATG ACGCTGGGCA TGCTCGACGG CGAACAGGCG
CGTGAACTCA AGGACGCCGG CCTCGACTAC TACAACCACA ACCTCGACAG CGCGCCCGAT
TTCTACGGCC AGGTCATCAG CACCCGCACC TATCAGGACC GCCTCGACAC GCTCGGCAAC
GTGCGCGACG CCGGCATCAA CGTCTGCTGC GGCGGCATCG TCGGCATGGG TGAAAGCCGC
ACCCAGCGCG CCGGGCTGAT CGCGCAGCTG GCGAACCTGT CGCCGTATCC GGAGTCGGTG
CCGATCAACA ACCTGGTGCC GGTGCCGGGC ACGCCGCTGG CCGATGCCGA GCCGATCGAC
CCGTTCGAGT TCGTGCGCAC GATCGCGGTG GCGCGCATCA CGATGCCGAC CACGATGGTG
CGGCTGTCGG CCGGGCGCGA GCAGATGGAC GAAGCGCTGC AGGCGCTGTG CTTCGCCGCC
GGCGCCAACT CGATCTTCTA CGGCGACAAG CTGCTGACCA CGAGCAACCC GCAGGCCGCC
CGCGACCGCG CGCTCTTCGA GCGCCTGGGC CTGCGCGTGC AGGGCGAGCG CCCGGCCGTG
CGTACATCGG ACAACTGA
 
Protein sequence
MTTATISLST LQASRPSVAA RADAAARWRV ADVEALYALP FMDLLFQAQQ VHRAHFDANE 
VQLSTLLSIK TGGCAEDCGY CPQSAHFDTA VEASKLMPID EVLDAANAAK AQGATRFCMG
AAWRSPKERD MERVTEMVRE VRALGLETCM TLGMLDGEQA RELKDAGLDY YNHNLDSAPD
FYGQVISTRT YQDRLDTLGN VRDAGINVCC GGIVGMGESR TQRAGLIAQL ANLSPYPESV
PINNLVPVPG TPLADAEPID PFEFVRTIAV ARITMPTTMV RLSAGREQMD EALQALCFAA
GANSIFYGDK LLTTSNPQAA RDRALFERLG LRVQGERPAV RTSDN