Gene Xaut_4690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4690 
Symbol 
ID5421176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp5197749 
End bp5200748 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content65% 
IMG OID640883953 
Producttype III restriction protein res subunit 
Protein accessionYP_001419566 
Protein GI154248608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.553282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.43699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGCT TTTACGAGCG GCCGATTCTG AACTCGCCAT ATTGCGTGCC GGGATTGCAC 
CATCCACTCG ACAAGAATGG CCAGCCGATC GACGGCGAGC CGAGGCAAGG CCGCCGCCCC
TCCCGGCACA TCGTGCCCGT CCCGGCATCG CGGAAGAAAG CGTCGGCCGG TCAGGCCTCG
CTCGATCTGG AAACCTACAG CGACTACCCC CTTATCAACG AGATTCGTAG TTTCGTCGAC
ACATGGCGCG CGCTGCCCAA CCCTGCCGAC TGGGGGGTGA CAGGGACGAC GCAGCGGCTT
CTGGAACACT GGCGGCATGG GCAGTTTTCA GGCCCGCAGC CGTTCTTCTG CCAGGTTGAA
GCGGCCGAAA CCATCATCTG GCTGACCGAG GTCGCCCCGA AACGCACCGC CACCAGGCGC
CTGCGCGACG AACTGGAACA GCACAATAAG GGAGCCAATC CCGCGCTCTT TCGTCTTGCC
ATGAAGATGG CGACCGGCAG CGGCAAGACC ACCGTCATGG CCATGCTGAT CGCCTGGCAG
GCGGTGAATG CTGCGCGCAA GGCAACGAAG GATTTCTCGC GCGCCTTCCT GATTATCGCC
CCGGGCATCA CCATCCGTGA CCGCCTGCGT GTGCTGCTTC CCAGCGAGCC GGACAATTAT
TACGAGACGC GCGAGATCGT GCCGCCGGAG ATGCTGCCGG ATATCCGGCG CGCGGAAATC
GTCATTACCA ACTACCACGC GTTCCAGCAC CGCGAGACGC TGGCCTTGCC CAAGGTCGCC
CGCAGCTTCC TGCAGGGCAA CGCTCCGGAG CCGTTAAAAA CCACGGAAAC CGACGCCGAG
ATGCTCGAAA GAGCATGCGG CAAGCTGCTC AATTACGATC GCGTCACGGT CATCAACGAC
GAGGCGCACC ATTGCTACCG CCACAAGGTG GGCGGCGACG CCGAAGGTCC GCTCACGGGC
GAGGACAAGA AGGAGGCGGC CGAGAACGAA GAGGCGGCGC GGCTCTGGAT CAACGGCATC
GAGGCGCTCG ACCGCAGGCT TTCGAAGGGG GTGCGCGCGG TCTATGACCT CTCCGCCACG
CCTTTCTTCC TGCGCGGCTC GGGCTATCAG GAGGGGTATC TCTTCCCCTG GGTGGTGTCC
GATTTCAGCC TGATGGATGC CATCGAGAGT GGCATCGTCA AGCTGCCCCG CGTGCCTGTG
ACCGACAATC TCGTGCAGAC CGATAGGGTG GTCTACCGCG ACTTGTGGAA GTCGATCGGC
AAATCGCTGC CGAAGACCGC CGCCGGCGCC GCCAAGATCA GCCCGTTCGA TCTGCCCCCG
ACGCTGCTGA CCGCGCTGAC CGCGCTCTAC AGCCATTACG AGGGCGAGTT CAAGCGGTGG
GAGCGGGCCG GGATCGGGGT GCCGCCGGTG TTCATCGTGG TGTGCCAGAA CACGGCCATT
TCCAAGCTGG TGTTCGAATG GATCGCCGGC TTCGAGCGCG GCGATGCCAA TGAGGGCGAG
CGCGCGGCCT TTCATGCGGG CCACCTCGAG CTGTTCCGCA ATTATGACGA ACACGGCGCC
CGCCTTCCCC GCCCGCGCAC GCTCCTGATC GATTCCCGCG AGATCGAATC CGGCAAGGCG
CTGGACAAGG GGTTCCGCGA CGCCGCCGGC CCGGAGATCG AGCAGTTCAA GCGCGAGCTG
GCCAGCCGCG AGGGCGCCGG CAGCGTGAAC GGCGACGTGA GCGAGGGCGA GCTGCTGCGC
GAGGTGATGA ACACCGTGGG CCGCCCCGGC CGCCTCGGCG AGCAGATCCG CTGCGTGGTC
TCGGTGTCCA TGCTGACCGA GGGGTGGGAC ACCAACACCG TCACCCACAT CCTCGGGGTG
CGCGCCTTCG GCACCCAGCT CCTGTGCGAG CAGGTGGTGG GCCGTGCCCT GCGCCGGCAA
TCCTATGACC TGAACCGCGA GATCGGCCTG TTCGACGTGG AATATGCCGA CATCATTGGC
ATTCCGTTCG ATTTCGCCGC CTCGCCGCAG AAGGCCGAGC CGGTGAAGCC GAAGCCGGTC
ACGCGCGTCC ATGCCATCAA GGAGCGGGCG GGGCTGGAGA TCGCCTTTCC CCGTGTCAGC
GGCTATCGGC GGGAGCTGCC TTCGGAAAAG CTGGAGGCGG TGTTCTCCGA CGACAGCCGG
CTGGAGATCA CGCCGCTGGA CATCGGCCCC ACGGCGGTGG TCATGGAGGG CATCGTCGGC
GCCGGCGTCA CCATCACGCC CGAGGTGCTG GACCGGCTGC GGCCGTCCGA GATCAGCTTC
AACCTCGCCA AGCACCTGCT CTATTCGCAT TTCCGCGACG AGGAGGGCTT TCCCAAGCAG
CACCTGTTCC CGCAGATCCA GCGTCTTGCG CGCCGCTGGA TCGACGAGGG CTATCTGGTG
ACGAAGGGCG TGCCCATCGG CGCCATCCTC TATCAGGACC AGCTGGCCCG CGCGGCGGAG
AAGATCGACA TTGCCCTCAC TCGCGGCAGC GAGGGGCAGA TCATGGCGGT GCTTGATCCC
TACAATCCGA AGGGGTCCAC CCGCCACGTC AACTTCATCA CCTCGAAGCC CTGCTGGAAG
ACCGGCGCGC AGCCGCCCAA GTGCCAGATC AGCCATGTGG TGCTGGATTC CAGCTGGGAG
GAACAGCTGG CCCTCACGCT GGAGACTCAT CCGCGTGTGA TCGCCTATGC CAAGAACCAG
GCGCTGGGCT TCGAGATCCC CTATCTCGAT GGCGGCACCA TGCGGCGCTA TGTGCCCGAT
TTCCTCGTGC GGCTGGATGA CGGCGGCACC ACGCCGCTCC ATCTGGTGCT GGAGGTGAAG
GGCCTGCGCG ACGAGGCCGA CAAGGCCAAG GCGCAGACGA CCCGCGACCT GTGGGTGCCC
GGCGTCAACG CCCTCGGCGG CTTCGGCCGC TGGGACTTCG CCGAATTCCG TGACTGGACG
ACCATGACCG AAGACTTCGC CGCGCTGGTT GAACGACTGC TCAAGAAGGT TGCCGCCTGA
 
Protein sequence
MTSFYERPIL NSPYCVPGLH HPLDKNGQPI DGEPRQGRRP SRHIVPVPAS RKKASAGQAS 
LDLETYSDYP LINEIRSFVD TWRALPNPAD WGVTGTTQRL LEHWRHGQFS GPQPFFCQVE
AAETIIWLTE VAPKRTATRR LRDELEQHNK GANPALFRLA MKMATGSGKT TVMAMLIAWQ
AVNAARKATK DFSRAFLIIA PGITIRDRLR VLLPSEPDNY YETREIVPPE MLPDIRRAEI
VITNYHAFQH RETLALPKVA RSFLQGNAPE PLKTTETDAE MLERACGKLL NYDRVTVIND
EAHHCYRHKV GGDAEGPLTG EDKKEAAENE EAARLWINGI EALDRRLSKG VRAVYDLSAT
PFFLRGSGYQ EGYLFPWVVS DFSLMDAIES GIVKLPRVPV TDNLVQTDRV VYRDLWKSIG
KSLPKTAAGA AKISPFDLPP TLLTALTALY SHYEGEFKRW ERAGIGVPPV FIVVCQNTAI
SKLVFEWIAG FERGDANEGE RAAFHAGHLE LFRNYDEHGA RLPRPRTLLI DSREIESGKA
LDKGFRDAAG PEIEQFKREL ASREGAGSVN GDVSEGELLR EVMNTVGRPG RLGEQIRCVV
SVSMLTEGWD TNTVTHILGV RAFGTQLLCE QVVGRALRRQ SYDLNREIGL FDVEYADIIG
IPFDFAASPQ KAEPVKPKPV TRVHAIKERA GLEIAFPRVS GYRRELPSEK LEAVFSDDSR
LEITPLDIGP TAVVMEGIVG AGVTITPEVL DRLRPSEISF NLAKHLLYSH FRDEEGFPKQ
HLFPQIQRLA RRWIDEGYLV TKGVPIGAIL YQDQLARAAE KIDIALTRGS EGQIMAVLDP
YNPKGSTRHV NFITSKPCWK TGAQPPKCQI SHVVLDSSWE EQLALTLETH PRVIAYAKNQ
ALGFEIPYLD GGTMRRYVPD FLVRLDDGGT TPLHLVLEVK GLRDEADKAK AQTTRDLWVP
GVNALGGFGR WDFAEFRDWT TMTEDFAALV ERLLKKVAA