Gene CPF_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1772 
SymbolilvE 
ID4202013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1997463 
End bp1998488 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content34% 
IMG OID638082644 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_696208 
Protein GI110800346 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0397129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA AGACAGCTAT TGATTGGAAT AATCTAGGCT TCTCTTATAT GAAAACAGAT 
TATCGTTACA TATCTCACTA TAAAGATGGT AAATGGGATG AAGGAAAATT AGTTACAGAC
AACAAATTAA GCATAAGTGA AGCTTCAACT GCCCTTCACT ATGGCCAACA ATGTTTTGAA
GGTTTAAAAG CTTATAGAAC AAAGGATGGA AAGATTCAAC TTTTTAGAGT AGATGAAAAT
GCTAAGAGAA TGAATAAATC ATGTGATAAA CTTTTAATGC CTGAAATACC AGTTGAAAAA
TTCATAGATG CTTGTATGCA AGTTGTCAAG GCTAATGAAA GATTTGTACC TCCATACGGT
ACTGGTGCAA CTCTTTATAT AAGACCTTTC ATGATAGGTG TTGGTGATAA TATAGGTGTT
AAATCTGCTC CTGAATTTAT ATTTTCAGTA TTCTGCCTTC CAGTTGGTGC TTATTTTAAA
GGTGGAATGA AGCCTGTAAA CTTTATGATT GCAGATTATG ATAGAGCTGC TCCTAAAGGA
ACTGGTGCCG CTAAAGTTGG TGGAAATTAC GCAGCAAGCT TAAAGGCTCA TGAAATAGCT
GCAAAAAAAG GATTTGCTGA TTGTATATAT TTAGACCCAG CAACTCACAC TAAAATTGAG
GAAGTTGGAG CTGCAAACTT CTTTGGAATA ACAAAGAAAG GTGAGTTTGT TACTCCATAT
TCAGAATCAA TTTTACCAAG TATAACAAAA TACTCTTTAA TGCAAATAGC TAAAGATTAT
TTAAAAATGC CTGTATCAGA AAGAGATGTT TTAATAGATA ACTTAGATGA ATTCGCTGAG
GCTGGCGCTT GTGGTACAGC CGCTGTAATA ACTCCAATAG GAGGAATAGA ATATAAGAAT
AAACTTCATG TTTTCCATAG CGAAACTGAA GTTGGTCCTA TTACTAAAAA ACTTTATGAT
CTTTTATCTG GAATGCAATT TGGAGATGTA GAAGCTCCTG AAGGATGGAT ATTTGAAGTT
AAATAA
 
Protein sequence
MDKKTAIDWN NLGFSYMKTD YRYISHYKDG KWDEGKLVTD NKLSISEAST ALHYGQQCFE 
GLKAYRTKDG KIQLFRVDEN AKRMNKSCDK LLMPEIPVEK FIDACMQVVK ANERFVPPYG
TGATLYIRPF MIGVGDNIGV KSAPEFIFSV FCLPVGAYFK GGMKPVNFMI ADYDRAAPKG
TGAAKVGGNY AASLKAHEIA AKKGFADCIY LDPATHTKIE EVGAANFFGI TKKGEFVTPY
SESILPSITK YSLMQIAKDY LKMPVSERDV LIDNLDEFAE AGACGTAAVI TPIGGIEYKN
KLHVFHSETE VGPITKKLYD LLSGMQFGDV EAPEGWIFEV K