Gene Paes_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1950 
Symbol 
ID6459982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2131469 
End bp2132512 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content54% 
IMG OID642725935 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_002016609 
Protein GI194334749 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG GGTCATTGGA CATCGGTCAG GCGATTATGC TCGCGCCTAT GGAAGAGGTG 
ACTGATCGTT CTTTCAGGAA AATCTGTAAG CGGTTCGGGG CAGATGTTGT CTATACTGAA
TTTATCAGTG CCGAAGCGAT CCGTCGACAG GTCGATACGT CGCTCCGTAA AATGCGGTTT
GACGATGATG AGGCTCCTGT TGTCGTTCAG ATTTTCGGCA ACAGCCAGGA AGCTATGGCT
GAGGCTGCTG TCATTGCGGC ATCGTGCAGC CCGGCATGGA TCGATATCAA TTTTGGTTGT
CCCGCAAAAA AAGTTGCCGG GAAAGGGGCC GGTGCTGCCC TGCTGAAAGA ACCCGATAAA
ATGGTCGCGA TTGCCGCTGC GGTTGTCCGT GCGGTCAATC TGCCTGTTAC GGTCAAAACG
CGTCTTGGCT GGGATCGCGA TTCAATCAAT ATTGTCGATA TCGTGCCCCG TCTGGAGGAT
GTCGGGGTTC AGGCTCTGGC GATTCACGGG CGGACGCGCA GCGAGATGTA CAGGGGCGTG
GCCGATTGGG AGTGGATCCG AAAGGTGACG GAGAAGGCCA GAATACCGGT TATCGCCAAT
GGCGATATAT GGAGTGCAGC GGATGCCAGG GCGATGTTTG CCGAGACCGG GGCTGATGCT
GTCATGATAG GACGAGGAGC GATTGGTAAT CCTTTTATCT TCGCTCAGGC CAGGGAGCTT
CTCGATACGG GGCGCGTGGT GACGCATCCT GATTACAGGG ATCGCATTGC CGTTGCGGTC
GAACATCTGA AACTGTCGGT TGAATTCAAG GGGGAGAAGT ACGGAAGCCT TGAAATGCGG
AGGCATTACT CAACCTATCT GAAAGGACTT CCGCGGGTTT CCAGGGTTCG CGACAAGCTG
GTTCGCGAAC CGGACTGGAG AAACGTCATC GAAATCCTTC AGGCTTACGA GGTCGAATGC
GAAGGCTATG AACGCGAAGG AAAGATCCGC GATTATGCGG AATTTCTCAA TGACCATTCG
AAGCGATTGA CATTAAACTA TTAG
 
Protein sequence
MKVGSLDIGQ AIMLAPMEEV TDRSFRKICK RFGADVVYTE FISAEAIRRQ VDTSLRKMRF 
DDDEAPVVVQ IFGNSQEAMA EAAVIAASCS PAWIDINFGC PAKKVAGKGA GAALLKEPDK
MVAIAAAVVR AVNLPVTVKT RLGWDRDSIN IVDIVPRLED VGVQALAIHG RTRSEMYRGV
ADWEWIRKVT EKARIPVIAN GDIWSAADAR AMFAETGADA VMIGRGAIGN PFIFAQAREL
LDTGRVVTHP DYRDRIAVAV EHLKLSVEFK GEKYGSLEMR RHYSTYLKGL PRVSRVRDKL
VREPDWRNVI EILQAYEVEC EGYEREGKIR DYAEFLNDHS KRLTLNY