Gene Paes_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2008 
Symbol 
ID6459833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2205183 
End bp2206526 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content48% 
IMG OID642725991 
Productprotein of unknown function DUF21 
Protein accessionYP_002016665 
Protein GI194334805 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000403932 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.229254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCAG ATATTTTCGA ACTTTTTATT CTTCTTTGCC TCATACTGGC AAACGGCTTT 
TTCTCCATGG CGGAATTCGC CATTATCTCA TCAAGGGAGA CCAAATTACA TGAATTGCAC
GAAGCCGGTG TTTCGAGAGC GGGCCTTGTC CTCGAACTGC TTGATAACCC CGGAAAATTT
CTTTCGGCCA TTCAGGTAGG GATTACCCTG ATCGCAACAC TCGCAGGGGC ATTCAGCGGC
ATCACTCTGT CTGCGCCCAT AGCGGAAATG ATCGAGCGTG CAGACGCGCT CAAACCATAC
AGCAATGAGC TTGCTCTTGG CCTCGTGGTT ATAGGCGTCA CCTACTTCAC CCTGATTATC
GGTGAACTTG CCCCGAAAAA AATAGCCCTG CAACACCCTG AAAAAATCGC ATTGTCTGTT
GCAAAAATCA TAGACATCAT CTGCAGGGTC ATTGCGCCGA TCGTACACCT GATCAACGGA
TCAACCAACA TCGTACTGAA AATCATGGGC ATCAAACCAA CCGAAAAGCC CACGGTAAGC
GACGAAGAAG TGATGCTGCT GCTCAAGCAG GGAGCAAAAA AAGGGGTGTT TGAATCGGTC
GAATACGATA TGGTTTCACG AATTTTCCGG ATGAGCGACA AACGGGCAAA TTCGATGATG
ACCCCCAAGA GCGAAATAGA GTGGCTGGAT CTTTATGCCA CCGAAGAAGA GCTCATTTCG
AAAATGCAGG CCAGTGGCCG ATCGAGATTT CCTGTCTCAG AAGGCAGTCT CGATAACCTG
AAGGGAGTCG TTCGCTCGCT CGATCTGGTC AACAAGCAGC TCCTGAGCCA GGGCAATCTG
AAGGATGCCA TCCGCAATGC GATGAAAGCC CCGCTCTTTG TTCCTGAATC GATCCCTGCG
TTTCAGGTTC TCGAACTTTT CAAGGAAAAC CGGGCTCACC TTGCACTGGT TGTCGATGAA
CAGGGTTCGG TGCAGGGAGG AATAACAATC ACCGATGTCC TTGAAAGCAT TGTAGGCGAT
ATTCCGGCCG ATGACATCGA AGGAAACCGC AAAATCGTAC GCCGGAGTCA GCGGACATGG
ATCATTGACG GACTGCTGCC GGTCGATGAT TTCATTCAGG AATTCCATCT TGAAAACTTT
CTGGATGAAG ACAATCCGCT CTATGATACC ATGGGGGGGT TCATGATGAC GAAACTTGAA
AAAGTCCCTT CTGTCATGGA TATACTCGAA TGGCAGGGGA TACTCTTCAA AGTCATTAAA
ATGAATAAAC AGCGGGTAGA CAAAATCCTG GCTGTTTTCA ATAACGACGC CCACGATAAA
GCGTCAAAAT ACGATACGAA ATGA
 
Protein sequence
MDSDIFELFI LLCLILANGF FSMAEFAIIS SRETKLHELH EAGVSRAGLV LELLDNPGKF 
LSAIQVGITL IATLAGAFSG ITLSAPIAEM IERADALKPY SNELALGLVV IGVTYFTLII
GELAPKKIAL QHPEKIALSV AKIIDIICRV IAPIVHLING STNIVLKIMG IKPTEKPTVS
DEEVMLLLKQ GAKKGVFESV EYDMVSRIFR MSDKRANSMM TPKSEIEWLD LYATEEELIS
KMQASGRSRF PVSEGSLDNL KGVVRSLDLV NKQLLSQGNL KDAIRNAMKA PLFVPESIPA
FQVLELFKEN RAHLALVVDE QGSVQGGITI TDVLESIVGD IPADDIEGNR KIVRRSQRTW
IIDGLLPVDD FIQEFHLENF LDEDNPLYDT MGGFMMTKLE KVPSVMDILE WQGILFKVIK
MNKQRVDKIL AVFNNDAHDK ASKYDTK