Gene Gobs_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0698 
Symbol 
ID8752355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp743203 
End bp744324 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content69% 
IMG OID 
ProductCapsule synthesis protein, CapA 
Protein accessionYP_003407846 
Protein GI284989292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAG GGCGGCACCG GTCGCCGCGG CCGGCCCGGC CCATGCGTGT GCCCGTGCTG 
CTGGTGGCCC TGATTTTCGC GCTGGCCGCC GCGTTCTCGC TGGGCAGCAG CACCGAGAAG
TCCGCACCCT TGACGTCGGG CAGCTCGCCG CCGCCGCGCG ACACCAGTGT CGGTATCAGT
GCGGTGGGCA ACGTCATCAT GGGATCGACC CCCGAGTTAC CTCCCGATGG CGGGCGGCAC
CTGTTCGACG GGGTGGCCGA CCGGCTGGCC GGAGACGTGG TGCTGGCCAA TCTGGATCAG
GCGCTCACCG ATGCAGCGGC CTCGACCAAG TGCGGAGCAG ACAGCAGTAG CTGCTATGCG
TTCCGCACGC CGCCCTCGTA TGCCCGGTGG CTGCGCCAGG CCGGTTTCAC GGTGATCAAT
CTGGCCAACA ACCATTCGCG CGACTTCGGC GATGCCGGGC TGCGCGACAC TCAGGCGGCG
CTGACCGCTC ACAATCTGCA GTACACCGGC ATGCCGGGGC AGATCACGCT GCAGGACGTC
GGCTCGGTGC GGGTGGCGAT CCTCGGCTTC GCGCCCTATC ACTGGGCACA AAGCCTGCTC
GACATTCCCG CCGCCCAACA AATGGTGCGG CAGGCTGCTG CCCAGGCCGA TCTGGTCCTG
GTCACCATCC ACGCCGGCGC CGAGGGCGCC GACCGCGGGC ACGTACCGCC GGGCACCGAG
GTTTTCCTCG GCGAGGACCG CGGTGATGCG GTCGCGTTCT CCCACGCGGC CATCGATGCC
GGCGCTGATG CGGTGCTCGG TGCCGGTCCG CACGTGTTGC GGGGCATGGA GTGGTACCGA
GGTCGCCTGA TCGCCTACAG CCTGGGCAAC TTCCTGGGCT ACGAGACGCT GTCGCACACC
GGAGCACAAG GGGTGGGCGG CATCGTGACG CTGCAGCTGA CGCCCGATGG CAGCTGGCAC
AGCGGACAGC TGGAGGGCAC CGTCATGGTC GCCCCGGGAG TGCCGCAGAT CGATCCCGAC
CAGCGCGCCC GCGCACTCGT GCAGGAGTTG TCCCGCACCG ACTTCGGCGC CTGCGGCGTG
CAGCTCTCCG CCGCCGGTGA ACTGAACACC CCCACCTGCT GA
 
Protein sequence
MKAGRHRSPR PARPMRVPVL LVALIFALAA AFSLGSSTEK SAPLTSGSSP PPRDTSVGIS 
AVGNVIMGST PELPPDGGRH LFDGVADRLA GDVVLANLDQ ALTDAAASTK CGADSSSCYA
FRTPPSYARW LRQAGFTVIN LANNHSRDFG DAGLRDTQAA LTAHNLQYTG MPGQITLQDV
GSVRVAILGF APYHWAQSLL DIPAAQQMVR QAAAQADLVL VTIHAGAEGA DRGHVPPGTE
VFLGEDRGDA VAFSHAAIDA GADAVLGAGP HVLRGMEWYR GRLIAYSLGN FLGYETLSHT
GAQGVGGIVT LQLTPDGSWH SGQLEGTVMV APGVPQIDPD QRARALVQEL SRTDFGACGV
QLSAAGELNT PTC