Gene Franean1_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0949 
Symbol 
ID5669363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1111392 
End bp1112813 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content71% 
IMG OID641239877 
Productaminotransferase class-III 
Protein accessionYP_001505311 
Protein GI158312803 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.762342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.330894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTGT CCAACGCGAC GACCACGACT CCTCGTCCCG GTGACCCGCC GCGGACTGGG 
CACGCCGGGC GGGTGACCGA TCTCCTCGAG CGGGAGGAGC GCGCGCTGCA GGCCCGCACT
CCGGCGTCCG AGGCCATGCA CACCCGTGCC CTGCGGACGA TGACCGGCGG GGTCCCGTCC
TCCTACCAGC TGCGCGACCC CTGGCCGATC TACCTCACGC ACGGCCGCGG CTCGCTGGTC
TGGGACGTCG ACGGCAACGA GTACTCCGAC TTCCACAACG GGTACGGCTC GATGGTTCAG
GGCCACGCCC ACCCCGCGAT CGTGCGCGCG GTGACCGAGC GGATGGCGCT CGGCTCGCAC
TTCGCCATGC CCACCGAGGA TTCGGTGCTG GTCAGCGAGG AGCTGGCCCG CCGCTTCGGG
CTGCCGCAGT GGCGTTACGT CAACTCCGGC TCCGAGGCGA CCATGGACGC CATCCGCATC
GCCCGCGGGG TGACCGGCCG GGACACCATC GTCAAGATCT TCGGCTCGTA CCACGGCCAT
CACGACTACG TGATGGTCTC GATCGGCACC CCCTACGGGG ACATCGGGCC GGCGGACCAT
ATGAACTCCC TGGCCTACGG CGCGGGAATC CCGCAGGCGG TGGTCGACCT GACGGTGCCC
GTGCCGTTCA ACGACGCGGC GGCGATGGAA CGGCGGATCG CCGCGCTCGA GGCCGAGGGG
CGCAAGCCCG CCTGTGTGAT CATGGAGGCG GCGATGATGA ACCTCGGCGT CGTCCTGCCC
GAGCCGGGCT ACCTGGAGGC CGTCCGGGAG ATCACACGCA GGCACGGCAT CGTGCTCATC
TTCGACGAGG TCAAGACCGG GCTGTGCGTC GCGGCCGGCG GCGCCGTCGA GCGGTTCGGG
GTGCTGCCCG ACATGGTCAC CCTCGCCAAG GCGCTCGGCG GCGGACTGCC GGCCGGCGCG
ATCGGCGCCA CCGCGGAGCT GATGGCCGCG GTCGCCGAGG ACAGGGTGAA ACAGGTGGGG
ACCTTCAACG GCAACCCGCT GGTCATGGCG GCGGCGCGGG CCAGCCTGAC CGAGGTCCTG
ACCCCGGACG CCTACGCCCA CCTCGACCGC CTCAACGACC GCCTGGTGGA CGGCTGCACC
GCGATCCTGG CCCGCCACGG CATCGCCGGC TACGCCGTCG GGATCAGCTC GAAGGGATGC
GTCCACTTCA CCGACGCCCC GATCCGGGAC TACACGTCCT TCATGGCCCA CCAGAACGCG
GTCCTGCCGG AGCTGGCCTG GCTCTACAAC GCCAACCGGC AGGTGCTGAT GGCCCCCGGC
CGGGAGGAGG AGTGGACGCT GTCCGTCCAG CACACCGACG CCGACGTCGA CCGCTACCTG
GCGAGCCTCG ACGCGATGGC CGCGGACCTC GCCCGCGGCT GA
 
Protein sequence
MTVSNATTTT PRPGDPPRTG HAGRVTDLLE REERALQART PASEAMHTRA LRTMTGGVPS 
SYQLRDPWPI YLTHGRGSLV WDVDGNEYSD FHNGYGSMVQ GHAHPAIVRA VTERMALGSH
FAMPTEDSVL VSEELARRFG LPQWRYVNSG SEATMDAIRI ARGVTGRDTI VKIFGSYHGH
HDYVMVSIGT PYGDIGPADH MNSLAYGAGI PQAVVDLTVP VPFNDAAAME RRIAALEAEG
RKPACVIMEA AMMNLGVVLP EPGYLEAVRE ITRRHGIVLI FDEVKTGLCV AAGGAVERFG
VLPDMVTLAK ALGGGLPAGA IGATAELMAA VAEDRVKQVG TFNGNPLVMA AARASLTEVL
TPDAYAHLDR LNDRLVDGCT AILARHGIAG YAVGISSKGC VHFTDAPIRD YTSFMAHQNA
VLPELAWLYN ANRQVLMAPG REEEWTLSVQ HTDADVDRYL ASLDAMAADL ARG