Gene Gura_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3641 
Symbol 
ID5164254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4270136 
End bp4271260 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content59% 
IMG OID640551125 
Productleucyl aminopeptidase (aminopeptidase T)-like protein 
Protein accessionYP_001232367 
Protein GI148265661 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000266351 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGCA AAGCCTTCTC GGATCTTTTC AGCATCAACA TGGGGGTAAA AAGCGGGGAG 
CGCATCCTGG TCTTCAGCGA TACGATCCGC CCCGACGAAA CGCCATCTGC GGCTGATGCG
GACCGCCGCG CCAGGCTGTT GCAGACCGCT GCTGATGCGG CTGCCTATGC CGGGAAGATA
TACGGCAACA CCACTTTCAT CTCGTTCCCC GCCACCACCG CCTCCGGCGC TGAACCGCCG
GAAGCTCTCT GGCGAGCAGC CTTGGGTGAC AGTGCTGCCG ACAAGCTCGT GGAGGCTGGC
ATCCTGCCAC GGCTTCTATC CAAGGAGGCC ACCCCGGAAG AAGTCGAACG AGCCAGAGAA
ATCGTCATTA TGGGAAAAGG AGCCGTTGCC GATGTGGTAA TTGCTCTCGC CAACAATTCC
ACCAGCCACA CACGCTTCCG CTCCCTGATA AACGCCGCCG GCGGACGCTT CGCCAGCCTT
CCCCACTTCG ACCCAGCGAT GTTTTTCACC TCCATGCAAG TCGACTGGCA GGCCCTTGTC
GAACGAACCG CCAAGCTCGC CGGAGAAATA AACGGTGCCG TGGAAATCGA AGTGACCACC
CCCAACGGCA GCCGGATGCG TATCGGCAAG CAGGGAAGGA TTGCCGAAGG GGACGACGGC
CTTTTAACCG CGCCGGGTAG CTTTGGCAAC CTGCCGGCCG GCGAAGTCTA TCTGGCGCCG
CTGGAAGGAA CATGTGAAGG AATAATGGTT CTGGAGTATG CGCCAAACCG CAAACTCGTC
TCGCCGATTG AGCTTGTAGT TAAGAACGGG ATTGTCACCG AGATTCGCGG CGATGAACCG
TACAGACATA AACTGGAGCA GAAATTCGCT GAGAGCGCAA AAAATCGGAA TATCGCCGAG
CTGGGAATCG GCACCAACGA CAAGGCGAGC AGGCCGGACA ACATCCTCGA AGCGGAAAAA
ATTCTCGGCA CCATCCATAT TGCCCTGGGG GACAATTCCG GCTTCGGCGG CACCGTCAGC
ACCCCGTTCC ACGAGGACTA CGTGTTTTAC GAGCCGACGC TGACCGCCAT CATGGCCGAC
GGGACGGAGA AAATCCTGCT GCGCCAAGGG CAACTTACTA TCTGA
 
Protein sequence
MYSKAFSDLF SINMGVKSGE RILVFSDTIR PDETPSAADA DRRARLLQTA ADAAAYAGKI 
YGNTTFISFP ATTASGAEPP EALWRAALGD SAADKLVEAG ILPRLLSKEA TPEEVERARE
IVIMGKGAVA DVVIALANNS TSHTRFRSLI NAAGGRFASL PHFDPAMFFT SMQVDWQALV
ERTAKLAGEI NGAVEIEVTT PNGSRMRIGK QGRIAEGDDG LLTAPGSFGN LPAGEVYLAP
LEGTCEGIMV LEYAPNRKLV SPIELVVKNG IVTEIRGDEP YRHKLEQKFA ESAKNRNIAE
LGIGTNDKAS RPDNILEAEK ILGTIHIALG DNSGFGGTVS TPFHEDYVFY EPTLTAIMAD
GTEKILLRQG QLTI