Gene SeSA_A2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2201 
SymbolcbiD 
ID6515435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2089213 
End bp2090352 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID642747272 
Productcobalt-precorrin-6A synthase 
Protein accessionYP_002115065 
Protein GI194734874 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC TTTCTTTTGA CGCGCCGGTC TGGCACCACG GTAAAGCGTT ACGTAAAGGA 
TATACCACCG GTTCCTGCGC GACGGCGGCG GCAAAAGTCG CCGCGCTGAT GGTATTGCGT
CAACATCTGA TTCATCAGGT CTCCATCGTC ACGCCGTCGG GCGTCACGCT ATGCCTGAAC
GTGGAGTCGC CGCACATTGA AGGCCAGCAG GCGATAGCCG CAATTCGTAA AGATGGCGGC
GATGATGTGG ATGCCACGCA CGGAATGCTG ATTTTTGCCC GCGTTACGCT CAACGACAGC
GGTGAGATTA CGCTCACTGG CGGCGAAGGT ATTGGTACGG TAACGCGTAA AGGGGTTGGG
CTGCCGCTCG GGAGTGCCGC CATCAATCGT ACGCCGCGCC ATACCATTGA GTCAGCGGTG
CGCGAAGCGA TAGGCCCGGC GCGTGGGGCC GATGTGGAGA TTTTTGCCCC GGAAGGCGAA
GCGCGGGCGC AAAAAACGTA TAACTCGCGG CTTGGCATTC TTGGCGGCAT TTCCATTATT
GGCACTACCG GCATTGTGAC ACCGATGTCG GAAGAAAGCT GGAAACGCTC GCTATCGCTG
GAACTGGAGA TCAAACGGGC GTCAGGATTA ACGCGGGTGA TACTCGTGCC GGGCAACCAC
GGCGAACGGT TTGTTCGCGA ACAAATGGGC GTCGACACAC AGGCGGTCGT CACCATGAGC
AATTTTGTCG GCTACATGAT TGAAGAGGCG GTACGGCTGG GATTTTGCCA GATAGTGCTG
GTGGGGCATC CTGGAAAATT GATCAAAATC GCCGCCGGGA TCTTTCATAC CCATAGCCAT
ATTGCCGATG CGCGCATGGA AACGCTGGTC GCGCATTTAG CTTTACTGGG CGCGCCGCTG
GAGTTACTCA CCCTGGTCAG CGATTGCGAT ACCACTGAAG CGGCAATGGA GCACATTGAA
GCATATGGCT TCGGGCATAT CTATAACCAT CTTGCCAGGC GTATTTGTTT GCGAGTCATG
CAGATGCTGC GTTTTACCAA AACGCCGCCT GTCTGCGACG CCATCCTGTT TTCTTTTGAT
AACCATATTC TCGGCAGTAA TCGTCCCGTC GACGAGATTG CTAAGGAGCT GCAATGCTAA
 
Protein sequence
MSELSFDAPV WHHGKALRKG YTTGSCATAA AKVAALMVLR QHLIHQVSIV TPSGVTLCLN 
VESPHIEGQQ AIAAIRKDGG DDVDATHGML IFARVTLNDS GEITLTGGEG IGTVTRKGVG
LPLGSAAINR TPRHTIESAV REAIGPARGA DVEIFAPEGE ARAQKTYNSR LGILGGISII
GTTGIVTPMS EESWKRSLSL ELEIKRASGL TRVILVPGNH GERFVREQMG VDTQAVVTMS
NFVGYMIEEA VRLGFCQIVL VGHPGKLIKI AAGIFHTHSH IADARMETLV AHLALLGAPL
ELLTLVSDCD TTEAAMEHIE AYGFGHIYNH LARRICLRVM QMLRFTKTPP VCDAILFSFD
NHILGSNRPV DEIAKELQC