Gene Acry_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2094 
Symbol 
ID5161864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2306806 
End bp2308746 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content61% 
IMG OID640554016 
ProductRhs element Vgr protein 
Protein accessionYP_001235212 
Protein GI148261085 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.730907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTA ATATCGATCT GCTTGACCTC CAGACATCGC TTGGTAGCGG CGTCCTGACG 
CTGGTCGGGG TCAATGGCCA GGCGGCGCTC AGCCAGTGCT TCCGGTATAC GCTGTCGGTG
CGCGCGGGTG CCGGCGGGCT CGATCCGAAC GATCTTCTAT ACACGTCGGT CACCGTCGCG
ATCGGGTCCG ACGGGACGAA CCAGACCTAC ATCAATGGCC TCGTCTCGTC CGTGGTCCAG
AAGCCCGGCA ACGCGGCCGG TACCGCGATC GCCGCCGGCC TCGAACTCTG GGATCATGAA
CTGACGGTCG TTCCCGCCCT CTGGTTCCTC GGGCAGACAC TCGATTGCCG GATATTCCAG
AACAAGACCG CCGTTGCCAT CGTCAAGGAA GTTCTGGACG AATTCGAGAT TACTTCATAC
GAGCTGCCGA GTGGCGGGAC GAGCTATGAC TATACAGTCA TGTTCAATGA GACCTATCTG
GACTTCATAA ATCGCATCCT CGCGCGCGAT GGGCTTTTCT ATTATTTCGT CCACGAAAAG
TCCGGACACA AGTTCGTCGT CGCCAGCAGC TCGCAATCTC TCCCCAAGTT GGGAGCCGTC
AAATTCGCCG GGCAGACCGC GACCGAAGTC GGGGTTCATA CGCTCAGCCG CGCCGACAGC
ACGACCATCG GCAAGTTCAT CGGGAACGAC TACAACTATG AAACGGCATC CACCGGCCTC
GTCTCCGATG CGGATACCGT GCTCAAGGCG AAGGGGGCGG CAACCCGCAA ATTCTATCGC
TACCCGTCGG AAAATCCCGT CAAGGAGGCG ATCGGACAGC AGGTGCGCCA TCAGAACGAG
GCGGCCGAGG TTCGTGCCGG GTTGTTCGCC GGTGGCGGCA ACACCCCGTC GATGCTCGCT
CCCGGCAACG CCGTCACCAT CACCGGCGAT CCCTTCGGCA TCGGCGATTA CGTCATTGCG
GCGGCCGCGC TCAGCGTGAC CGATCACGCC GGCATCGGCG GAGGGACGGC CACCGTCGAT
GTCGCATTCA CCCTGTTCGA CGCATCCGTT CCCTGGCGGC CGGAACTCCT GCCGAAGCCG
CAGATCGCCG GCTTGCAGTC CGCCCTCGTC GTCGGGCCCT CGGGCGATGA AATTTACACG
AACAAGTACG GCCGGGTGAA GGTTCAGTTC AACTGGGACA CGCGCGGCAA GAAGGACGAG
AACTCGAGTT GCTGGGTTCG CGTGATCCAG CCCTGGGCCG GCGCCGGCTG GGGGTTCCAG
TTCCTTCCGC GCATCGGCCA GGAGGTCGCC ATCAGCTTCC TCGAGAGCGA CATCGACCGG
CCGGTCGTGA TCGGTTCCTT CTACAACTCC GGCCAGGTCA GCCTGTTCAA TCTCCCCGCC
GAGCAGAACA AGGCGGGATT CCGCTCCCGC TCAACCAAGA GTGGCGGGAC GTCGAACTAC
AGCGAATTCA GTATCGACGA TACCAAGGGT TCGGAGGTTG TCCTGCTGCA TGCGGAGCGG
GATTATACGG TCGAGGTCGA GCATGATGAA ACCCGCACGA TCGGCAACAA CCGCACGGTG
ACCGTCAAGA AGGACGAGGC GATCACCGTG GACGGCAACC AGACCGAGAC GGTGAAAGGA
AATCGTACCT TCGAGGTCAA GCAGGACCAT TCCGAGACGG TCGATGGCAA CCAGTCCGTC
ACGGTCAAGG GCAAGCAGAG CGTGTCGGTG CAGGGGCAGC AATCCGTATC CGTGACCGGG
GCCGTCACCT ATGAGTCGAT GGAGTCGATC ACGCTGAAAG TGGGCGGAAA CAGCATCAAG
ATCGACATGA CGGGCATCAC GCTCTCCGGC ACCCTCATCA AGATCAATGC CCAGGCCGAG
CTGTCGACGA GCGGCGCGAT TGCGCAGCAT TCCGGCTCGG GGATGCTGAA ACTCCAGGGC
GGGATCATCA TGGTGAACTG A
 
Protein sequence
MTTNIDLLDL QTSLGSGVLT LVGVNGQAAL SQCFRYTLSV RAGAGGLDPN DLLYTSVTVA 
IGSDGTNQTY INGLVSSVVQ KPGNAAGTAI AAGLELWDHE LTVVPALWFL GQTLDCRIFQ
NKTAVAIVKE VLDEFEITSY ELPSGGTSYD YTVMFNETYL DFINRILARD GLFYYFVHEK
SGHKFVVASS SQSLPKLGAV KFAGQTATEV GVHTLSRADS TTIGKFIGND YNYETASTGL
VSDADTVLKA KGAATRKFYR YPSENPVKEA IGQQVRHQNE AAEVRAGLFA GGGNTPSMLA
PGNAVTITGD PFGIGDYVIA AAALSVTDHA GIGGGTATVD VAFTLFDASV PWRPELLPKP
QIAGLQSALV VGPSGDEIYT NKYGRVKVQF NWDTRGKKDE NSSCWVRVIQ PWAGAGWGFQ
FLPRIGQEVA ISFLESDIDR PVVIGSFYNS GQVSLFNLPA EQNKAGFRSR STKSGGTSNY
SEFSIDDTKG SEVVLLHAER DYTVEVEHDE TRTIGNNRTV TVKKDEAITV DGNQTETVKG
NRTFEVKQDH SETVDGNQSV TVKGKQSVSV QGQQSVSVTG AVTYESMESI TLKVGGNSIK
IDMTGITLSG TLIKINAQAE LSTSGAIAQH SGSGMLKLQG GIIMVN