Gene Tgr7_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1950 
Symbol 
ID7316339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp2071068 
End bp2072498 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID643616843 
Productprotein of unknown function UPF0027 
Protein accessionYP_002514018 
Protein GI220935119 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00816316 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCGA ATCGCTTCAC GAAACTCTCC GACACCGCCT GGCAGCTTGA GCCCACGGGC 
AAGATGCGCG TGCCGGCGAT CCTGTACGCC AGCGAGGCGT TGCTGCGGGA GATGGACGAC
AAGGTGGCCG AGCAGGCGAC CAACGTCGCC ACCCTGCCCG GCATCGTGCA GGCCAGCTAC
GCCATGCCCG ACGCCCACTG GGGCTACGGC TTTCCCATCG GCGGCGTGGC CGCCTTCGAC
GCAGACGCGG GCGGCGTGGT TTCGGCTGGC GGCGTAGGCT TCGACGTCTC CTGCGGCGTG
CGCACCCTGC ACACGGGGCT GACCCGGGAG GCGATCGAGA AGATCAAACC GGCCCTGGCC
GATGCCCTGT TCGAGTCCAT CCCGGCAGGA CTGGGCAGCA CAGGCTACAT CCACCTGCGG
GACCATCAGA TGACGGAGAT GCTGGCCGGC GGCGCGGTCT GGGCGGTGCA ACAGGGCTAC
GGCGAGGCGG CGGACCTGGA ACGCATCGAG GAACACGGCC GCATGGCCGG TGCCGATCCC
CACGCGGTCT CGGAGCAGGC GCGCAAGCGC CAGCGCAACG AGATGGGCAC CCTGGGTTCA
GGCAATCACT ATCTCGAGGT GCAGCACGTC ACCGAGATCT ACGATCCCGC CGTGGCCAAG
GTGTTCGGCC TGGCCGTGGG CCAGGTGGTG GTGAGCATCC ATTGCGGCTC CCGGGGCCTG
GGCCACCAGA TCGGCACCGA GTTCCTGCGC GAGATGGCGG TGGCGGCGAA CCGCCACGGC
ATCGAGCTGC CGGACCGGGA ACTGGCCTGC GCGCCCATCC GCTCGGAACT GGGCGAGCGC
TACCTGGGCG CCATGCGCTC GGCCATCAAC TGCGCGCTGG CCAACCGCCA GATCCTCACC
CACCTGACCC GGCGCGTGTT CGCGAAGGTC CTGCCCGAGG CGCGCCTGGA CCTGCTCTAC
GACGTCTCCC ACAACACCTG CAAGGTGGAG ACCCACAGCA TCGACGGCAG CCCTCGCCAG
CTCTACGTGC ACCGCAAGGG CGCCACCCGC GCCTTCGGCC CCGGCCACCC GGACCTGCCC
GACGCCCTGC GCCCGGTGGG CCAGCCGGTG CTGATCGGCG GCTCCATGGG CACGGCCTCC
TACATCCTGG TGGGCACCAA CGAGGGCGAA CGGCTGTCCT TCAACTCCGC CTGCCACGGC
GCGGGCCGGG CCATGAGCCG GCATGCCGCG ACCCGCCAGT GGCGCGGCCG CGCGCTGGTG
GATGAGCTGG CCGGGCGCGG CATCCTGATC CGCAGCCCCA GCCTGCGCGG CGTGGCCGAG
GAGGCGCCCG GGGCGTACAA GGACGTGAGC GAGGTGGTGA AGGCGACCCA CCAGGCGGGC
CTGGCGAGGA TGGTGGCGCG GGTGGAGCCG TTGGTGTGCA TCAAGGGGTA G
 
Protein sequence
MDPNRFTKLS DTAWQLEPTG KMRVPAILYA SEALLREMDD KVAEQATNVA TLPGIVQASY 
AMPDAHWGYG FPIGGVAAFD ADAGGVVSAG GVGFDVSCGV RTLHTGLTRE AIEKIKPALA
DALFESIPAG LGSTGYIHLR DHQMTEMLAG GAVWAVQQGY GEAADLERIE EHGRMAGADP
HAVSEQARKR QRNEMGTLGS GNHYLEVQHV TEIYDPAVAK VFGLAVGQVV VSIHCGSRGL
GHQIGTEFLR EMAVAANRHG IELPDRELAC APIRSELGER YLGAMRSAIN CALANRQILT
HLTRRVFAKV LPEARLDLLY DVSHNTCKVE THSIDGSPRQ LYVHRKGATR AFGPGHPDLP
DALRPVGQPV LIGGSMGTAS YILVGTNEGE RLSFNSACHG AGRAMSRHAA TRQWRGRALV
DELAGRGILI RSPSLRGVAE EAPGAYKDVS EVVKATHQAG LARMVARVEP LVCIKG