Gene Meso_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_3643 
Symbol 
ID4181990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp3935132 
End bp3936733 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content61% 
IMG OID638069537 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_676177 
Protein GI110635969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTCT CGAGACGAGG ATTTTTGTGC CGCTCCGCCG CCGCAGGCGC CCTTCTTTCC 
ATGCCGCTCA AGGCGTTCGC CCAGTCGGGA GAAATGGGCA TGCTGCGAGT GGCGGTTTAC
ACGGACATGG TGGGTTACGA CCCGATCGTG ACCACGTCGA ACATCGCGGC CTATCATGGT
GCGCTCGTGT ACGACATGCT TTTCGGCAAC GATGAAAACC AGATGCCGCA TCCACAAATG
GTCGGCGACT ACACGATATC CGAGGACAAG CTGACCTGGA CGATGACGCT GCGAGACGGT
CTGACCTTCT CCGACGGCAG CCCTGTGACG ACGGCCGACG TGATACCATC CATCCTGCGC
TGGCAGGCGC GTGCCAGCCA GAACGGCAAG CTGCTGGCGG CGGCTACGCA GGAGCTGGTG
GCCTTGGACG ACCGCACCTT CCAATTCAAG CTCAAGGAGC CGTTCCCGTT GTTGGCGGCA
ATGCTGGGCA GCCCGGCAAC GCCACTGTGC TTCATCATGC GCAAGCGCGA GGCGGAAATG
GATCCGGCTC AGGCGGTCGA CGTCTGCATC GGCTCCGGCC CCTATGTGCT CAACACGCAG
GAAACCCGTC CAGGTATCGA CTACGTTTAC GATCGTAATC CGAACTATGT GCCCCGCGAG
GAGCCCGCGA GCGGGCTTTC GGGCGCCAAG ATTGCGAATT TCGAGCGCGT CATTCTGGTG
AACATGCCGG ACGCTCAAAC TGCGATCGCG GCGCTGCAGG CCGGTGAGAT CGACTTCTAC
GAAATTCCGC CGATCGATTT CCTCCCCGTT CTGGAAAGCG ATTCAAATCT CAAGGTCGCC
GACATCATGA AGTCCGGCAC CGAGGGCTCC ATCATTCTCA ACTGGCTGCA GCCGCCCTTC
GATAACCTGA AGGTGCGGCA GGCCATGCTG TATGCCATTG ACCAGGAGGC GGTCCTGAAG
GGCCTCTTCG GCGATCCGAA CTGGTTTAAC GCGCATCCGA GCTGGTTCAC CTACGGTTCG
CCGCTTTACA ACGAAGCGAA CTCCGAGTGG TTCAGGATAG CGCCGGATCC GGAGAAGGCG
AAGCAACTGC TGGCCGAGGG CGGCTACGAC GGAACCCCGG TCGTGCTTCT CCAGGCGACC
GACCGCCAGG TCAATGCCGA CGCCGTGACA ATCATCGCGC AGCAGATGCG GGCGGCCGGC
TTCAACGTGC AGATCGACGC CATCGACTGG GCGACTCTGC TGCAGCGCAG GCCGAACAAG
GGACCGGTTT CGGAGGGCGG CTGGAATGCC TTCGTCTCCA CCTTCAACGG CTTTATAAGC
TCGAACCCCT ACACATTCGG TCACATGGCC ACGATCGGGG AGAACGGGTG GTTCGGATGG
CCTTCGGACG AACGCAACGA GGAACTGAAG GCCGCCTGGA TGAAGGCCGA AACGCCTGAG
GAGCGGGTGG CGATTGCGGC GGAAATCCAG GAGAACGCCT GGAATATCGT GCCGCGCGTA
TCCTACGGGC ATTGGGTGCA ACCGGTCGCA TATCGCAGCA ACCTGGACGG CTTCGTTAGC
ATTCCCGGCG TACTTGCCTT CTGGAACGTC AAGCGAGTCT GA
 
Protein sequence
MTLSRRGFLC RSAAAGALLS MPLKAFAQSG EMGMLRVAVY TDMVGYDPIV TTSNIAAYHG 
ALVYDMLFGN DENQMPHPQM VGDYTISEDK LTWTMTLRDG LTFSDGSPVT TADVIPSILR
WQARASQNGK LLAAATQELV ALDDRTFQFK LKEPFPLLAA MLGSPATPLC FIMRKREAEM
DPAQAVDVCI GSGPYVLNTQ ETRPGIDYVY DRNPNYVPRE EPASGLSGAK IANFERVILV
NMPDAQTAIA ALQAGEIDFY EIPPIDFLPV LESDSNLKVA DIMKSGTEGS IILNWLQPPF
DNLKVRQAML YAIDQEAVLK GLFGDPNWFN AHPSWFTYGS PLYNEANSEW FRIAPDPEKA
KQLLAEGGYD GTPVVLLQAT DRQVNADAVT IIAQQMRAAG FNVQIDAIDW ATLLQRRPNK
GPVSEGGWNA FVSTFNGFIS SNPYTFGHMA TIGENGWFGW PSDERNEELK AAWMKAETPE
ERVAIAAEIQ ENAWNIVPRV SYGHWVQPVA YRSNLDGFVS IPGVLAFWNV KRV