Gene TK90_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_2031 
Symbol 
ID8807806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp2151401 
End bp2152729 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID 
ProductPGAP1 family protein 
Protein accessionYP_003461258 
Protein GI289209192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGCG AGACGCACGA GCCGCTTGTT GGACTGATCC GGGTGGGCAG GAAGGCGGAC 
ATGGACGGAG CAAGGATTGC CGCGCTGGTG CTGGCGGCGG GGTGGCTGAC GGGCTGCGGG
TTGTGGGATG CGCGTGGACA AATGCAGATG ATGGGCCAGG CCTGCACGAT TTCCGGCACC
GTAGTGGCGG ATGACGCGGT CCCGGGGCCG TATGTGGTCG CGGTATTCCG CGCGCCGATG
GAGGAGGGCG CGGTTCCGGA ACCGGTGGAC CATGTCGTGA GTGCCGGTGG CGGCGAGTGG
TTCTTTGGCT TGGCGCCGGG GCGTTACCAG GTCCTGGCCT TTGCCGACCC CGAACGTGAC
GGCCAGCACG AGGCGGGGGC GCCGGTATAC CTGGCGAACC AGGGCGGCAT GCTGGACTGC
CCTGCGGGCA CGCGCTTGGG CAATATGGAG ATCCAGATCG AGGGTGAAGG GGTCGCGGGG
CATGCGATCG CGCTGCCCGT CATGCGCGGT GCCGGTCCGG ATGGCAGTCC CATCAGCGTT
GGGGGGGTCA CGGCGTTCGG GGAAGTGACC ACGCTGGATG ATCCGCGTTT CGACGACGAT
GTCGCGCGCG GGAGCCAGTG GCGGCCGGTG GATTTCATGC TGGCTGGTTA TGCCGGGATC
TATTTCCTGG AGCCCTACGA CCCCGACCGC ATCCCGGTGC TGTTCGTGCA TGGGATGAAT
GGCTCCCCGC GGGGGTTCGC CGAACTCATC GACCAACTCG ATCGCGAGCG CTACCAGCCC
TGGCTGTATT ACTACCCGTC CGGGCTCCCC CTGCAGTCCA TCGCCGCACA CCTGGCCCAG
ACTCTGGAAG AAATCGAGTT GCGCTATGAA GTGGAGTCGC TGCCGGTCGT TGCGCACAGT
ATGGGCGGCC TGGTGGCAAA GGGCTTTCTG CATGAGCGCG CACGTCGCGC GTCGCCGGCC
CATATCCCGC GAATGATTGC GCTGTCTACG CCCTGGCATG GGCATGCGGC TGCGCAGTCG
GGGGTCGATC GCTCGCCGGT GGTGATCCCG GTCTGGCGCG ACATGGTGCC CGGTAGTGAA
TACCAGCGGC GGTTGTTCGA GTCGGAGCTG TTGGAGGAGA CCGAACTGCA TCTGCTGTTC
AGCTTCCGCC GCCCGGAAAG CGGGGCGCGT GCGGGTACGG ACGGCGTGCT CACCCTGGCG
ACCATGCTGT ACCCCCCGAT TCAGGCGATG GCGAGCAGCG TCTATGGGGT GGATACCACG
CATGCGGGGA TTCTCACGCA TCCGATGGCG CTGGAGCGGG TGCAGATGCT GCTCGAGTCT
GGCTCCTGA
 
Protein sequence
MGGETHEPLV GLIRVGRKAD MDGARIAALV LAAGWLTGCG LWDARGQMQM MGQACTISGT 
VVADDAVPGP YVVAVFRAPM EEGAVPEPVD HVVSAGGGEW FFGLAPGRYQ VLAFADPERD
GQHEAGAPVY LANQGGMLDC PAGTRLGNME IQIEGEGVAG HAIALPVMRG AGPDGSPISV
GGVTAFGEVT TLDDPRFDDD VARGSQWRPV DFMLAGYAGI YFLEPYDPDR IPVLFVHGMN
GSPRGFAELI DQLDRERYQP WLYYYPSGLP LQSIAAHLAQ TLEEIELRYE VESLPVVAHS
MGGLVAKGFL HERARRASPA HIPRMIALST PWHGHAAAQS GVDRSPVVIP VWRDMVPGSE
YQRRLFESEL LEETELHLLF SFRRPESGAR AGTDGVLTLA TMLYPPIQAM ASSVYGVDTT
HAGILTHPMA LERVQMLLES GS