Gene Paes_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1670 
Symbol 
ID6459544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1817692 
End bp1819236 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content50% 
IMG OID642725658 
Productglycosyl transferase family 39 
Protein accessionYP_002016335 
Protein GI194334475 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000109819 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.252715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCTA AAGATAACGC ACACTCATTG CAGGACATGC TCATACTGGC TTTTCTCTGC 
ATTGTCAGTT TCTATGCCTG GACGGGCTCC GTGCCGCTCT TTGACGTCGA TGAAGGAGCA
TTCAGTGAAG CGACAAGAGA GATGCTGAAA AGCGGAAACT ACCTGACAAC CTATCTCAAC
GGTGAACCGC GTTTCGACAA GCCAATTCTG ATCTACTGGC TGCAACTGGC AAGCATATCC
CTTTTCGGCA TCAATGAATT CGCTTTCCGC CTTCCTTCGG CACTGGCATC GACGCTATGG
GCAGCATCGA TCTACCTCTT TGCTGGCAAG ATGCTCGATC GCCGCGCAGG ATTTATTGCC
GCAGCGGCAA TGATCCCCAC CCTGCAGATC ACAATGATCG CCAAAGCAGC CATTGCCGAT
GCCCTGCTCA ACAGCATGCT CGCCATCAGC ATGTTCTCCA TTTTTCTCTA TTACAAAGAA
CGACAGCAGC GCTTCGTGCT CATCGCTTTT ACTGCAATAG GGCTCGGCAC TCTGACCAAA
GGACCGGTTG CCATCCTCAT CCCGCTTGTT GTCTCGGCAC TCTTTTTTCT AAGCCAAAGA
GAAACAAAAG CCTGGCTGAA AGCGATATTC AATCCTTCAG GCATCGCACT GTTTTTGCTT
ATCGTTCTGC CATGGTACAC CCTGGAATAT CTTGATCAGG GAATGCGCTT CATCGAAGGC
TTTATCCTCA AACACAACAT AGAGCGCTTC AGCACCCCTT TTGAACAGCA CACCGGCTCA
ATCTGGTACT ACCTCCCGGT CCTCCTGGCT GGATTAACGC CCTCAACGGG GCTCATCATC
CCTCTGGCGA CAGAGATAAA AAAACTCGTC AAAACGCCCC TCTCAGCCTA TCTGCTGATC
TGGTTCGGAT TCGTATTCCT CTTTTTTTCT TTCTCCGGAA CGAAACTCCC CCACTATATC
ATCTACGGCT ATACCCCGCT CTTCATCCTC TTTGCCATGG TCTTCAAGAC GATAAAAAAA
CCATGGCTGC TTGGAATCTG GCCCGCTCTG ATCCTCGTCG CACTCTCGAT GGCCCCGTTG
TTCATCGCCA GAATTGCGCT ATCAGCGAGC AACCCCTATA TACGCGATCT CCTGCATGGT
GCCCTGAAGC TCATGGGCCC CGAATACACA ATCCTGCTTG TCTCCATTGC CCTCATAACC
CTGCTTGTCT GGACAGCCGG AAAGATCCCT GCCCGTTACA GACTCGCCGC AACAGGAGCA
ATCTTCTGCC TCGCCTTCAA CCAGGTTATC ATGACTCGAG CCGGAGAACT GCTCCAGCAG
CCAGTCAAGG AAGCTGCTCT TCTGGCCAGG AACAACGGCT ACAAAATCGT CATGTGGAAA
GTCAACTACC CCTCTTTTTT AGTATATTCG GGAAATTCAG TTGAAAAAAG AGCCCCCAAA
CCAGGTGAGA TTGTCTTCAC ATCGGTCAAA TATATCGACC GGTTGAACGC CAGCGAGATT
CTCTATCAGA AAAACGGTCT GGTTCTTGCA AAAATCCAAC AATAA
 
Protein sequence
MQPKDNAHSL QDMLILAFLC IVSFYAWTGS VPLFDVDEGA FSEATREMLK SGNYLTTYLN 
GEPRFDKPIL IYWLQLASIS LFGINEFAFR LPSALASTLW AASIYLFAGK MLDRRAGFIA
AAAMIPTLQI TMIAKAAIAD ALLNSMLAIS MFSIFLYYKE RQQRFVLIAF TAIGLGTLTK
GPVAILIPLV VSALFFLSQR ETKAWLKAIF NPSGIALFLL IVLPWYTLEY LDQGMRFIEG
FILKHNIERF STPFEQHTGS IWYYLPVLLA GLTPSTGLII PLATEIKKLV KTPLSAYLLI
WFGFVFLFFS FSGTKLPHYI IYGYTPLFIL FAMVFKTIKK PWLLGIWPAL ILVALSMAPL
FIARIALSAS NPYIRDLLHG ALKLMGPEYT ILLVSIALIT LLVWTAGKIP ARYRLAATGA
IFCLAFNQVI MTRAGELLQQ PVKEAALLAR NNGYKIVMWK VNYPSFLVYS GNSVEKRAPK
PGEIVFTSVK YIDRLNASEI LYQKNGLVLA KIQQ