Gene Francci3_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3824 
Symbol 
ID3905572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4584238 
End bp4585665 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content73% 
IMG OID637881150 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_482903 
Protein GI86742503 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0707619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0367563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTC CCAACCCGGC GGCGACCGGA ATCGGCCTGC CCGCCACCGG CGGCCCGCGG 
CTGCCCCAGG AGCCGCGTCT GCACACCGAG ATCCCCGGCC CGCGGTCCCG GGCACTCGCC
GCGCGCCGCG TCGGCGCCGT GGCCCGGGGG GTCGGTGAGA CCCTCCCGGT CTACGCGCAC
GCGGCGGGCG GCGGGGTCGT GGTCGACGTG GACGGCAACT CGCTCATCGA CTTCGGTTCG
GGGATCGCGG TGGTCAGCGT CGGCAACGCT GCGGACGGGG TCGTCGAGGC CGTGCGCGAG
CAGATCGGCC GGTTCACCCA CACCTGCTTC ATGGTGACGC CGTACGAGGG CTATATCGCC
GTCTGTGAGG CGCTGAACCG GCTGACGCCG GGTACCCATG AGAAACGCTC GGCATTGTTC
AACTCCGGAG CGGAGGCCGT CGAGAACGCC GTCAAGATCG CGCGCAGCGC GACCGGCCGG
GGCGCGGTCG TCGTCTTCGA GCACGCTTAC CATGGCCGGA CGAACCTGAC GATGGCGATG
ACCGCGAAGT CGATGCCCTA CAAGAGTGGG TTCGGGCCGT TCGCGCCCGA GGTCTACCGG
ATGCCGCTGG CGTATCCCTA CCGCTGGCCG ACCGGTCCCG ACCGCTGTGG GGAGGAGGCC
GCCGAGCGGG TGATCGAGCT CGTGCGGGAC GAGATCGGGG CCGCGTCCGT CGCCGCGATG
GTCATCGAGC CGATCCAGGG GGAGGGCGGT TTCATCGTCC CCGGGACCGG GTTCCTGCCC
CGCCTCGCCG AGTTCTGTAC GGTCGCGGGG ATCGTGTTCG TCGCGGACGA GGTGCAGACC
GGGTTCGCCC GGACCGGGAC GATGTTCGCC TGCGAGCACG AAGGCGTCGT CCCGGATCTC
ATCACCACCG CCAAGGGCAT CGCCGGCGGC CTGCCGCTCG CGGCCGTCAC CGGCCGTGCG
GAGATCATGG ACGCCCCCCA GGTCGGCGGG CTCGGCGGCA CCTACGGGGG GAACCCGGCG
GCCTGCGCCG CGGCTCTCGC CGCGATCGAC CTCATCGAGT CCGAGGACCT GGCCGGCCGG
GCCCGCCGGA TCGGCGAGAT CGCCCTGCCC CGCCTGCACG CGTTGCGCGA GCGGTACGAC
TTCGTCGGGG ATGTCCGGGG GCGGGGCGCG ATGGTCGCCC TGGAGCTGGT CCGGGGCGCC
GGCGACGACA GCCCGGACAA GGTCCTCACC GCCGCCGCGG CCGCCGCCTG CCATCGTCGC
GGCCTGATCG TGCTCACCGC GGGCACCTGG GGCAACGTGC TCCGGCTGCT GCCACCGCTG
GTGATCGAGG AGCCGTTGCT GCTGGCTGGC CTCGATCTCC TCGACGAGGC GTTCGCGGAG
CTCGCGGCGC GCCGTCAGCC TCCCTCCCCG GCCCGCCCGG AGGGGTGA
 
Protein sequence
MTIPNPAATG IGLPATGGPR LPQEPRLHTE IPGPRSRALA ARRVGAVARG VGETLPVYAH 
AAGGGVVVDV DGNSLIDFGS GIAVVSVGNA ADGVVEAVRE QIGRFTHTCF MVTPYEGYIA
VCEALNRLTP GTHEKRSALF NSGAEAVENA VKIARSATGR GAVVVFEHAY HGRTNLTMAM
TAKSMPYKSG FGPFAPEVYR MPLAYPYRWP TGPDRCGEEA AERVIELVRD EIGAASVAAM
VIEPIQGEGG FIVPGTGFLP RLAEFCTVAG IVFVADEVQT GFARTGTMFA CEHEGVVPDL
ITTAKGIAGG LPLAAVTGRA EIMDAPQVGG LGGTYGGNPA ACAAALAAID LIESEDLAGR
ARRIGEIALP RLHALRERYD FVGDVRGRGA MVALELVRGA GDDSPDKVLT AAAAAACHRR
GLIVLTAGTW GNVLRLLPPL VIEEPLLLAG LDLLDEAFAE LAARRQPPSP ARPEG