Gene Dshi_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1371 
Symbol 
ID5712547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1424219 
End bp1425220 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content67% 
IMG OID641267283 
ProductABC transporter related 
Protein accessionYP_001532714 
Protein GI159043920 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.138832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTC GCGAAGAAAT CGACGGGTTC GACCACAGCC CAGAGCCACC GAAACCCGCG 
CCCGCACGCG CGCCGAAGGT CACCCCGTCG CAGCCCCTTG CGGGGGAGGC GGACCTGGCC
AGCCGCAAGG GCAATTGGCG CTACCTCGCG CCGGAGGCCG CGGCGCCCCA GGCCGACGCC
GAGGCACCGG CCGCGGCGCC CGAGCCGACG CGCGCGACGG ATGCGCCTGC GACCGTGATC
TCGGCGCGGG ATCTGGACCT GGTGTTCCAG ACCAACGATG GTCCGGTTCA TGCGCTGTCG
GGCGTCAATC TTGAGATCGG CAAGGGCGAG TTCGTCAGCT TTATCGGCCC CTCGGGCTGC
GGCAAGACCA CCTTCCTGCG CGCGGTCGCA GGGCTGGAGC ATCCCACGGG CGGCTCGCTC
ACGGTCAATG GCATGACCCC GGACGAGGCC CGGCAGGCCC GCGCCTATGG CTACGTCTTC
CAGGCGGCGG GGCTTTATCC GTGGCGCACC ATCGCCAAGA ATATCTCCCT GCCGCTTCAG
ATCATGGGCT ATTCCAAGGC CGATCAGGAG GCGCGCGTTG CCCGCGTGCT GGAGTTGGTG
GAGTTGTCGG GCTTTGCCAA GAAATACCCC TGGCAGCTGT CGGGGGGCAT GCAGCAACGC
GCGTCCATCG CGCGGGCGCT GTCCTTCGAT GCCGATATCC TGCTGATGGA CGAACCCTTC
GGAGCGTTGG ACGAGATCGT GCGCGACCAC CTCAACGAGC AGTTGCTCGC CCTGTGGAAG
CGCACCGAGA AGACCATCGG CTTCGTCACC CATTCGATCC CCGAGGCGGT CTATCTCAGC
ACCAAGATCG TGGTGATGTC CCCGCGCCCG GGCCGGATCA CGGATGTGAT CGACAGCCCG
CTCCCCCTCG ACCGCCCGCT CGACATCCGC GACACGCCGG AATTCATCGA GATTGCCCAC
CGCGTCCGCG AGGGCCTTCG GGCGGGGCAT CTGGATGAGT AG
 
Protein sequence
MLSREEIDGF DHSPEPPKPA PARAPKVTPS QPLAGEADLA SRKGNWRYLA PEAAAPQADA 
EAPAAAPEPT RATDAPATVI SARDLDLVFQ TNDGPVHALS GVNLEIGKGE FVSFIGPSGC
GKTTFLRAVA GLEHPTGGSL TVNGMTPDEA RQARAYGYVF QAAGLYPWRT IAKNISLPLQ
IMGYSKADQE ARVARVLELV ELSGFAKKYP WQLSGGMQQR ASIARALSFD ADILLMDEPF
GALDEIVRDH LNEQLLALWK RTEKTIGFVT HSIPEAVYLS TKIVVMSPRP GRITDVIDSP
LPLDRPLDIR DTPEFIEIAH RVREGLRAGH LDE