Gene CPS_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4574 
Symbol 
ID3520060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4827660 
End bp4828871 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID637287014 
Productputative MSHA biogenesis protein MshN 
Protein accessionYP_271222 
Protein GI71279601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTTA TTAATCAGAT GTTAAAGGAT TTAGAACAGC GCAGTCCTGA GTCTAATACC 
GATGCTACTC AGTCAGGTAA TGTTGCAGTA GCCCATTCCC CTATAAAAAT AGCGCTCGTT
ACTGGGTTTT GTGTGTTAGC CGTTTGTTTT CTTAGTTTTT ATGTTTGGCA ATTAATTAGT
GAAAATAACG CATTAAAAGC TGAAAAAATA ACGAATAAAG TTAATGCCGT TCAAATGAGT
TCAGCAAAAA ATAGACCTGA AAATATAAGT AGCCAGATTA ATACGTCTAA GCAAATAAGC
AGCAACGAAA ATACCGTTCA GAATGATCCA ATAAACGTAC ATGTCACTAA AATTTATGAT
CAACAGGAAA TAGCACCTAT AAATGGTCAA ATTGCTGAAC CAACAGATGT AAATAAAGTA
TTATCAAATA ACAGTGCTGA GACTACTGCC AAGTTAATAA CGGCAAAACC TTTGGTAAAT
AACAGCGCTA GCCAAGTAAC ACCAGTAAAG AAAGCGAAAG TTATAGCGGA TACCCATAGT
CATTCGGGAG ATAGTTCAGG TCATAGCCAC GACATTGTTG ATATCGTCAA AGCTAAACCT
AAGCCAAAAG TAAATAAAAT GTCGGTGTCA CGACGTCAAT TATCGGCGGA TGAACTAGCA
GAGCAAAAAT TAGTCCTCGC TGAAAAAGCA CTAGCGGCTA AGCAAATCGA GAAGGCCGAA
AAACTACTAG AAGATGTAGT CATTATCAGG CCGAGCGATA GTCAAACACG TAAAAAACTG
GCGGCTTTAT GGTTTGGCCG TCAAGCTTAT CAAGATGCTG TGAATTTATT GTCACAAGGC
ATCGCCTTAA ATGGTAAAGA CAGCAGTTTA CGTCAAATGA AAGCGCGCAT TCATTTAAAG
CAAGGGCAAT TCACGGCTGC GCTGAATACG TTAAAACCTC TTGCTCAATT AAAAGATGAG
CAATATCAAG TCATGCTGGC AAATACCGCA CAGCAAGCCA AACAAAATAA AATAGCCGTT
GATGCGTATA AAATGTTAAT AGCAATGAAA CCGGATATAG GCCGTTGGCC GCTAGGTTTA
GCCGTTTTGT ACGATAAAAA CAGCCAGTTT GAGTTGGCCA GTATGGCTTA TAAAAAAGCA
TTAACAAAAA ATGATTTATC AGTTTCTTCA GAAAACTTTG TTAAGCAACG CTTACAAGTA
ATAGGACAGT AG
 
Protein sequence
MSVINQMLKD LEQRSPESNT DATQSGNVAV AHSPIKIALV TGFCVLAVCF LSFYVWQLIS 
ENNALKAEKI TNKVNAVQMS SAKNRPENIS SQINTSKQIS SNENTVQNDP INVHVTKIYD
QQEIAPINGQ IAEPTDVNKV LSNNSAETTA KLITAKPLVN NSASQVTPVK KAKVIADTHS
HSGDSSGHSH DIVDIVKAKP KPKVNKMSVS RRQLSADELA EQKLVLAEKA LAAKQIEKAE
KLLEDVVIIR PSDSQTRKKL AALWFGRQAY QDAVNLLSQG IALNGKDSSL RQMKARIHLK
QGQFTAALNT LKPLAQLKDE QYQVMLANTA QQAKQNKIAV DAYKMLIAMK PDIGRWPLGL
AVLYDKNSQF ELASMAYKKA LTKNDLSVSS ENFVKQRLQV IGQ