Gene Noc_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2101 
Symbol 
ID3704411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2414935 
End bp2415972 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content54% 
IMG OID637738576 
Productbiotin synthase 
Protein accessionYP_344091 
Protein GI77165566 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTACT TTCCTCCTGC AGCTGAAATA TGTGACAGTC CGCGCCACGA TTGGTCTATC 
CCAGAGGTGC TGGCTTTGTT TGAATTACCG TTTGTAGAAC TTATTTATCG GGCGCAGACG
GTACATCGCC AGCATTTTAA TCCTAATCAA GTTCAGATGA GCACTTTGCT CAGTATTAAA
ACGGGGGGAT GTCCCGAAGA TTGTGCCTAT TGTCCCCAAA GTGTTCGCTA TAGCACGCCC
GTGAAAGCCG AACCTTTACT GCCCCTGGAG GAAGTATTGA CGGCGGCACG GAATGCCAAG
GCCCGGGGTG CAAGCCGTTT TTGTATGGGA GCGGCATGGC GCAGGCTCAA GGAGCGGGAG
CTGGAACCGG TAGCGAAGAT GATTACAGAG GTGAAAGCCC TGGGGTTAGA AACATGCGTG
ACATTAGGCA TGTTAGGTCC AGGACAAGCG GAACGGCTTA AGGCTGCGGG ACTAGATTAT
TACAACCATA ATCTGGATAC CTCACCGGAG TTTTACGGCG AGATCATTAC CACCCGTACC
TATCAGGATC GGCTGGAGAC CTTGTCTCAA GTCCGGGAAG CGGGCATTCA TGTGTGTTGT
GGCGGTATTG TGGGGATGGG CGAGGAGCGT TCTGATCGGG CGGGTTTGTT GGCCAACTTG
GCTAATCTGC CCCGTCACCC GGAGAGCGTT CCAATTAATA GGCTGGTCCA GGTAGAAGGT
ACCCCCTTGG CCGGGGCTCC CGAGCTAGAC CCCTTTGAGT TTGTGCGTAC CGTGGCCTGC
GCTCGAATCC TGATGCCCGC CTCCTTCGTG CGCCTTTCAG CAGGCCGAGA GACAATGAGC
GATGAATTGC AAGCTCTTTG TTTTCTTGCT GGAGCCAATT CCATTTTTTA TGGTGAAAAG
CTGCTCACGA CCCCCAATCC AACCACAGAT CACGACCAGC AATTGTTTGA GCGTTTGGGT
CTTGAGCTTT TGTTTCCCCA GGCACAGGTT GCCGCTCCCG TGCCGGAGGC TGATGAAGTG
GGATCGGCCT CTGGCTGA
 
Protein sequence
MTYFPPAAEI CDSPRHDWSI PEVLALFELP FVELIYRAQT VHRQHFNPNQ VQMSTLLSIK 
TGGCPEDCAY CPQSVRYSTP VKAEPLLPLE EVLTAARNAK ARGASRFCMG AAWRRLKERE
LEPVAKMITE VKALGLETCV TLGMLGPGQA ERLKAAGLDY YNHNLDTSPE FYGEIITTRT
YQDRLETLSQ VREAGIHVCC GGIVGMGEER SDRAGLLANL ANLPRHPESV PINRLVQVEG
TPLAGAPELD PFEFVRTVAC ARILMPASFV RLSAGRETMS DELQALCFLA GANSIFYGEK
LLTTPNPTTD HDQQLFERLG LELLFPQAQV AAPVPEADEV GSASG