Gene Francci3_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3174 
SymbolargJ 
ID3903899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3761349 
End bp3762515 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content72% 
IMG OID637880498 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionYP_482260 
Protein GI86741860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.710796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.355777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGTTA CCGCCCCACG GGGCTTCCGC GCCGCCGGAG TGGCGGCCGG GTTGAAGCCG 
TCCGGCCGTC CGGACGTGGC GCTCGTCGTC AACGACGGGC CGTCCGACGC CGCCGCCGGG
ATCTTCACCG CCAACCGGGT TCAGGCGGCT CCGGTGCTCT GGACCCGCCA GGTCCTCGCC
GACGGCCGGC TGCGCGCCGT CATCCTCAAC TCCGGCGGGG CGAACGCCTG CACCGGTCCG
GCCGGATTCG CCGATACCCA TGCCACGGCC GAGCACGTCG CCGGGGCGCT CGGCCTCGGT
GCCGGCGACG TCGCGGTCTG CTCGACCGGT CTCATCGGCG TCCGCCTGCC CATGGACAAC
CTGCTCGCCG GCGCCACCAA GGCGGTGGCG GACCTCGCTG ACACCGACAC CGCCGGCGCC
GCGGCTGCGG AGGCGATCCG CACCACCGAC ACCGTCGCGA AGGCCGCGGT CCGCACCTCG
TCAACGACCC CGGGTGTCAC CATCGGCGGG ATGGGTAAGG GCGCGGCGAT GCTGGCGCCG
TCGCTCGCGA CGATGCTCGT GGTGGTCACC ACCGACGCCG TCGCGGACGC CGCCACCCTC
GATCGGGTGA TCCGGGCGTC CTCGCGGGTC AGCTTCGAGC GGGTCGATTC CGACGGCTGC
CTGTCCACCA ACGACACCGT GCTGCTGCTG GCCTCGGGCG CCGCCGGGGT GAGCCTGCCC
GAGGCGGAGC TGACGGGGCT GGTCACCGAG GTGTGCATCG ACCTCGCCCA GCAGATGCTC
GGCGATGCCG AGGGTTCGAC GAAGACCATC GCCATCACCG TGACCGGAGC CGCGACCGAG
GACGACGGGC TGGAGGTCGG CCGGGCCGTC GCCCGCAACA ACCTGCTCAA GTGCGCTCTG
TACGGCAAGG ACCCGAACTG GGGACGGGTG CTCGCCGCGA TCGGCACCAC GAAGGCCGTC
TTCGAGCCCG ACGCGCTGGA CGTGTCCATC AACGAGGTGC GGGTCTGCCG GGCCGGGGCC
CCCGGCGAGG ACCGCGACCT CGTCGACCTG TCCGGCCGTG AGGTGCGCAT CGGCGTCGAT
CTGCACGCCG GCGATGCCGA GGTCACCGTC TGGACGAACG ACCTGACCGA CGGGTACGTC
TACGAGAACT CGGCGTACTC GACATGA
 
Protein sequence
MSVTAPRGFR AAGVAAGLKP SGRPDVALVV NDGPSDAAAG IFTANRVQAA PVLWTRQVLA 
DGRLRAVILN SGGANACTGP AGFADTHATA EHVAGALGLG AGDVAVCSTG LIGVRLPMDN
LLAGATKAVA DLADTDTAGA AAAEAIRTTD TVAKAAVRTS STTPGVTIGG MGKGAAMLAP
SLATMLVVVT TDAVADAATL DRVIRASSRV SFERVDSDGC LSTNDTVLLL ASGAAGVSLP
EAELTGLVTE VCIDLAQQML GDAEGSTKTI AITVTGAATE DDGLEVGRAV ARNNLLKCAL
YGKDPNWGRV LAAIGTTKAV FEPDALDVSI NEVRVCRAGA PGEDRDLVDL SGREVRIGVD
LHAGDAEVTV WTNDLTDGYV YENSAYST