Gene TM1040_1769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1769 
Symbol 
ID4076798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1860620 
End bp1861903 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content63% 
IMG OID638007084 
ProductVWA containing CoxE-like 
Protein accessionYP_613764 
Protein GI99081610 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.606293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAT ACCCCTCGCT TGCGATCTCG GACGATCCCA AACTCGCCGC GAATATCACC 
CATTTTGCGC GAGCTCTGCG CAAGGCCGGA CTCAATGTTG GTACCGGGCG GGTGCTCGAT
GCGATCAGGG CTGTTGAGGC CGCCGGTTTC ACGTCGCGGC GCGATTTTTA CTGGACCTTG
CACGCGTGTT TTGTCTCCCG TCCCGAAGAG CGCGTGGTCT TTGGGCAGGT GTTTCGTCTC
TTCTGGCGTG ACCCCCGGTT TCTGGAACAT ATGATGGCGG CGATGCTGCC CGCGATCCGA
GGCGTACAGC AGGAACGCGC GGCCAAACCT GCCGAGACCC GTGCTGCCGA GGCATTGCTG
GATGGGCAAC TGCCCGAACA TCCCGAGGAA TCACCCGAGG CCACGGACGA AGCAGAGGAG
ATCGAAATCG ACGCCGCAAT GACCCTCTCG GCCGAAGAGC GGCTCAAGAC GCTTGATTTT
GAACAGATGA CCACCGAGGA AATCCAGCAG GCCAAGCGGA TGCTTGCCAC ACTGAGATTG
CCGATTGCGC CTCTGAAAAC GCGCCGCCAT CAACCCGCGC CGCAAGGTGC AAGGCCGGAT
TGGCGGCGCA CGATGCGTGG TGCGGGCCGC ACCGGGGGCG AAATCGCGCG CATCGCCCGC
AGCAAACGCG CCGAGCGGTT TCCAAATCTT GTGGTTCTCT GCGACATTTC CGGCTCCATG
AGCCAGTACA GCCGCATGGT GCTGCATTTT CTGCATGCGG TCGCCAATCG ACCCGCTGAC
GGGCGGCAGG GGCGCTGGGC GCAGGTGCAT GGTTTCACCT TCGGCACCCG GCTCACCAAT
ATCTCGCGAC ATCTGAAGCA ACGCGATGTG GATGCGGCGC TCGCGGCGGC GGGTGCCGAG
GCGCAGGACT GGCAGGGAGG GACGCGGATC GGTGGGTGCC TGCATGCCTT CAACCGGGAC
TGGTCGCGCC GAGTCATGGG GCAGGGGGCG GTGGTGCTCC TGGTAAGTGA TGGGCTGGAC
CGCGACGTCC CAGAGACCCT GGCGCTGGAG ATGCAGCGCC TGCGGCTTTC TGCAGGGCGT
CTTGTCTGGC TCAACCCTTT GCTCCGGTGG GATGGGTTCC TGCCACGCGC CCGCGGCATC
CAGGCGATGC TGCCCCATGT GGACAGCTTT CGCGCAGGTC ACAATATTGC GTCGCTTGAA
GATCTTGCAC AAGCGCTCTC GCGGCCGGAT GACACTGGCG AAAAGCTGCG CCTGATGGCC
ATGATGCAGG AGGAACGCGC GTGA
 
Protein sequence
MVEYPSLAIS DDPKLAANIT HFARALRKAG LNVGTGRVLD AIRAVEAAGF TSRRDFYWTL 
HACFVSRPEE RVVFGQVFRL FWRDPRFLEH MMAAMLPAIR GVQQERAAKP AETRAAEALL
DGQLPEHPEE SPEATDEAEE IEIDAAMTLS AEERLKTLDF EQMTTEEIQQ AKRMLATLRL
PIAPLKTRRH QPAPQGARPD WRRTMRGAGR TGGEIARIAR SKRAERFPNL VVLCDISGSM
SQYSRMVLHF LHAVANRPAD GRQGRWAQVH GFTFGTRLTN ISRHLKQRDV DAALAAAGAE
AQDWQGGTRI GGCLHAFNRD WSRRVMGQGA VVLLVSDGLD RDVPETLALE MQRLRLSAGR
LVWLNPLLRW DGFLPRARGI QAMLPHVDSF RAGHNIASLE DLAQALSRPD DTGEKLRLMA
MMQEERA