Gene Ppha_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0520 
Symbol 
ID6461987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp509135 
End bp511078 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content55% 
IMG OID642726806 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_002017461 
Protein GI194335667 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGGA GGGCTCTTGC CTCTTCCCGC TATGTCAAGC TGCTGGTTGT TGTCGCATTC 
GACATCTTCG CAGCGCTGCT AACGGTCTGG ATGGCATTCT CATTCACCCG TCAACTCCTG
CATCAGCCGG AGGGGCTGGA GTGGCTGCTC TATCTGCTTG CGCCAGCTCT GATGCTCCCG
ATCTTTGCTC TTTCAGGCAT CTACGGCACC ATCCAGCGTT ATAACAGCTT TGAAGGATTT
ATCACCGTTT TCAAAAGCGT TTTTTTCTAC GGCGCCGCAT TTGTATGCCT TTTGCTGCTC
CTGCCTCTTC CGAAGATCTC CATTGTCGTC GGAATCCTTC AGCCGCTGCT GCTCCTGCTG
ATGACCGGGG GAAGCCGTGC AATGGTGCGC TATCTGAGTA CAACAGTAAC CACGACGCCT
GAAAAAAGGT GTGAAGCGCC CAACAAGCTC CTGATCTACG GAGCAGGTTC CGCAGGGGTG
GAGATTGTCA ATAGCATCAA CAGAAGCAAT AAATTTACCC TTGCCGGTTT TATTGACGAC
AATCCGGATT TGCAGGGGCG AACCATCAAC CGGATGAAGG TCTTCAGCCC TGCGGAGGCC
GAAGAGCTGA TCCAAAGCGA GGGAATCAAC AACATCCTTC TCGCAATGCC ATCGGCAACT
CGTACCCGGC GCAACGAAAT TGTTGAGCAA TACCGAAAAT ACCCGGTGCG TATCCAGATG
CTGCCGGGAG TTGAAGAGCT TGCCGAAGGA AAGGTCACCA TCTCCGACAT CAAGACCGTA
GACATCGAAG ATCTCCTTGG TCGTGATCCG GCGCCGCTCG ATCACGCCCT CGTCAAACAG
GTAATTACCG GCCGGGTTGT AATGGTCACC GGCGCCGGAG GCTCAATCGG CAGCGAACTC
TGCCGCCAAC TCCTTCAGGC CCAGCCATCA ACACTGCTGC TGCTCGACAA CGCCGAACAC
AACCTCTACA CCATTCACAG CGACCTTACC CAGCGCCACG CACGCCTCTC ATCCGCAACG
CGGATAGTGC CGCTGCTCTG CGACGTGACC AATGCAATCC GTATAACAGA AATATTCCGC
GTTTTTCAAC CTGAAGTGAT CTACCATGCC GCAGCCTACA AGCATGTGCC GATGGTCGAG
CACAACCCCG CAGAAGGGAT ACGCAACAAC GTTTTCGGAA CCCATACGGT GGCAGAAGCT
GCCCTGCAGC AGGGAGTAAC AAGCGTCGTC CTCGTCAGTA CCGACAAGGC AGTGCGCCCA
ACCAACGTGA TGGGCGCAAG CAAACGGCTC TGCGAAATGA TCCTGCAGGC ACTTGCCGAG
GAGCCCGGCC ATACAACCTG CTTCTCCATG GTGCGCTTCG GCAATGTCCT CGGCTCAAGC
GGTTCCGTTG TCCCTCTTTT CCGCAGTCAG ATCAACAGCG GTGGCCCCTT CACCATCACC
CACAAGGAGA TTACCCGCTA CTTCATGACC ATCCCCGAAG CGGCACAGCT TGTCATTCAG
GCAGGCGCCA TGGCCTCACC AGGCGATCTT CATCTCCTCG AAATGGGCGA CCCGGTCAAA
ATCATCGACC TTGCCCGAAA GATGGTAGAA CTCTCCGGCC TCACCATTCG CGACCATGAA
AACCCTGAAG GTGACATTGA AATACGGGTA ACCGGCCTTC GCCCAGGCGA AAAACTCTAC
GAAGAGTTGC TGATCGGCAA CCATTCAAGC CCTTCAGCCA ACCCCCGCAT TTTCAAAGCC
AAGGAGCACT TCATACCCTG GAGTGAACTG CAGGAAGAAC TTGAGCAACT GACCACAGCC
ATACACAGCA ATGACATCAA GAGCATCAAA AAGATCCTGA AAAAGATAGT CCCCGAATAC
AGCCCAGGGA TCACCACCTC AGACCTGCTC TCCATGGAAA AGCAAAGCAA AAACGGCAAA
AACCACCACA TACCAGCCCA GTAA
 
Protein sequence
MGRRALASSR YVKLLVVVAF DIFAALLTVW MAFSFTRQLL HQPEGLEWLL YLLAPALMLP 
IFALSGIYGT IQRYNSFEGF ITVFKSVFFY GAAFVCLLLL LPLPKISIVV GILQPLLLLL
MTGGSRAMVR YLSTTVTTTP EKRCEAPNKL LIYGAGSAGV EIVNSINRSN KFTLAGFIDD
NPDLQGRTIN RMKVFSPAEA EELIQSEGIN NILLAMPSAT RTRRNEIVEQ YRKYPVRIQM
LPGVEELAEG KVTISDIKTV DIEDLLGRDP APLDHALVKQ VITGRVVMVT GAGGSIGSEL
CRQLLQAQPS TLLLLDNAEH NLYTIHSDLT QRHARLSSAT RIVPLLCDVT NAIRITEIFR
VFQPEVIYHA AAYKHVPMVE HNPAEGIRNN VFGTHTVAEA ALQQGVTSVV LVSTDKAVRP
TNVMGASKRL CEMILQALAE EPGHTTCFSM VRFGNVLGSS GSVVPLFRSQ INSGGPFTIT
HKEITRYFMT IPEAAQLVIQ AGAMASPGDL HLLEMGDPVK IIDLARKMVE LSGLTIRDHE
NPEGDIEIRV TGLRPGEKLY EELLIGNHSS PSANPRIFKA KEHFIPWSEL QEELEQLTTA
IHSNDIKSIK KILKKIVPEY SPGITTSDLL SMEKQSKNGK NHHIPAQ